Loading...
Please wait, while we are loading the content...
Similar Documents
Similarity Search on the Web: Evaluation and Scalability Considerations
| Content Provider | Semantic Scholar |
|---|---|
| Author | Haveliwala, Taher H. Gionis, Aristides Klein, Daniel Indyk, Piotr |
| Copyright Year | 2001 |
| Abstract | Allowing users to find pages on the web similar to a particular query page is a crucial component of modern search engines. A variety of techniques and approaches exist to support "Related Pages" queries. In this paper we discuss shortcomings of previous approaches and present a unifying approach that puts special emphasis on the use of text, both within anchors and surrounding anchors. In the central contribution of our paper, we present a novel technique for automating the evaluation process, allowing us to tune our parameters to maximize the quality of the results. Finally, we show how to scale our approach to millions of web pages, using the established Locality-Sensitive-Hashing technique. |
| File Format | PDF HTM / HTML |
| Alternate Webpage(s) | http://ilpubs.stanford.edu:8090/526/1/2001-8.pdf |
| Language | English |
| Access Restriction | Open |
| Content Type | Text |
| Resource Type | Article |