Loading...
Please wait, while we are loading the content...
Similar Documents
A hierarchical online classifier for patent categorization
| Content Provider | Semantic Scholar |
|---|---|
| Author | Tikk, Domonkos Biró, György Törcsvári, A. |
| Copyright Year | 2007 |
| Abstract | Patent categorization (PC) is a typical application area of text categorization (TC). TC can be applied in different scenarios at the work of patent offices depending on at what stage the categorization is needed. This is a challenging field for TC algorithms, since the applications have to deal simultaneously with large number of categories (in the magnitude of 1000–10000) organized in hierarchy, large number of long documents with huge vocabularies at training, and they are required to work fast and accurate at on-the-fly categorization. In this paper we present a hierarchical online classifier, called HITEC, which meets the above requirements. The novelty of the method lies in the taxonomy dependent architecture of the classifier, the applied weight updating scheme, and in the relaxed category selection method. We evaluate the presented method on two large English patent application databases, the WIPO-alpha and the Espace A/B corpora. We also compare the method to other TC algorithms on these collections, and show that it outperforms them significantly. |
| Starting Page | 244 |
| Ending Page | 267 |
| Page Count | 24 |
| File Format | PDF HTM / HTML |
| DOI | 10.4018/978-1-59904-373-9.ch012 |
| Alternate Webpage(s) | http://www.igi-global.com/viewtitlesample.aspx?id=10185&ptid=339&t=a+hierarchical+online+classifier+for+patent+categorization |
| Alternate Webpage(s) | https://doi.org/10.4018/978-1-59904-373-9.ch012 |
| Language | English |
| Access Restriction | Open |
| Content Type | Text |
| Resource Type | Article |