Loading...
Please wait, while we are loading the content...
Similar Documents
Distributed information retrieval with skewed database size distributions (2004)
| Content Provider | CiteSeerX |
|---|---|
| Author | Si, Luo Lu, Jie Callan, Jamie |
| Description | In Proceedings of the NSF's National Conference on Digital Government Research (dg.o2003 |
| Abstract | The proliferation of government information on local area networks and the Internet creates the problem of finding information that may be distributed among many disjoint text databases (distributed information retrieval or federated search). A distributed information retrieval system is composed of three components: Resource representation, resource selection and result merging. Previous research suggested that the CORI algorithm is one of the most effective resource selection algorithms, but its effectiveness in environments containing a wide range of database sizes was not studied thoroughly. This paper shows that the CORI algorithm does not work well in environments with a skewed distribution of database sizes. We present a new resource selection algorithm based on estimating the distribution of relevant documents among the online databases. This new algorithm selects resources more accurately than the CORI algorithm, which can lead to improved document rankings. 1. |
| File Format | |
| Publisher Date | 2004-01-01 |
| Access Restriction | Open |
| Subject Keyword | Effective Resource Selection Algorithm Database Size Distributed Information Retrieval System Many Disjoint Text Database Government Information Cori Algorithm Skewed Distribution Resource Representation Result Merging New Resource Selection Algorithm Skewed Database Size Distribution Document Ranking New Algorithm Selects Resource Resource Selection |
| Content Type | Text |
| Resource Type | Proceeding Conference Proceedings Article |