Loading...
Please wait, while we are loading the content...
Similar Documents
2011b). United we fall, Divided we stand: A study of Query Segmentation and PRF for Patent Prior Art Search
| Content Provider | CiteSeerX |
|---|---|
| Author | Ganguly, Debasis Leveling, Johannes Jones, Gareth J. F. |
| Description | Previous research in patent search has shown that reducing queries by extracting a few key terms is ineffective primarily because of the vocabulary mismatch between patent applications used as queries and existing patent documents. This finding has led to the use of full patent applications as queries in patent prior art search. In ad-dition, standard information retrieval (IR) techniques such as query expansion (QE) do not work effectively with patent queries, princi-pally because of the presence of noise terms in the massive queries. In this study, we take a new approach to QE for patent search. Text segmentation is used to decompose a patent query into self-coherent sub-topic blocks. Each of these much shorted sub-topic blocks which is representative of a specific aspect or facet of the invention, is then used as a query to retrieve documents. Docu-ments retrieved using the different resulting sub-queries or query streams are interleaved to construct a final ranked list. This tech-nique can exploit the potential benefit of QE since the segmented queries are generally more focused and less ambiguous than the full patent query. Experiments on the CLEF-2010 IP prior-art search task show that the proposed method outperforms the retrieval ef-fectiveness achieved when using a single full patent application text as the query, and also demonstrates the potential benefits of QE to alleviate the vocabulary mismatch problem in patent search. |
| File Format | |
| Language | English |
| Publisher Institution | In Proceedings of the Patent Information Retrieval Workshop, CIKM 2011. ACM |
| Access Restriction | Open |
| Content Type | Text |
| Resource Type | Article |