Loading...
Please wait, while we are loading the content...
Similar Documents
Caching and Database Scaling in Distributed Shared-Nothing Information Retrieval Systems (1993)
| Content Provider | CiteSeerX |
|---|---|
| Author | Tomasic, Anthony Garcia-Molina, Hector |
| Abstract | A common class of existing information retrieval system provides access to abstracts. For example Stanford University, through its FOLIO system, provides access to the INSPEC database of abstracts of the literature on physics, computer science, electrical engineering, etc. In this paper this database is studied by using a trace-driven simulation. We focus on physical index design, inverted index caching, and database scaling in a distributed shared-nothing system. All three issues are shown to have a strong effect on response time and throughput. Database scaling is explored in two ways. One way assumes an "optimal" configuration for a single host and then linearly scales the database by duplicating the host architecture as needed. The second way determines the optimal number of hosts given a fixed database size. 1 Introduction Information retrieval systems, of the type found in libraries, provide indexed access to the abstracts of documents. Information vendors such as Dialog and BR... |
| File Format | |
| Language | English |
| Publisher Date | 1993-01-01 |
| Publisher Institution | Proc. ACM SIGMOD Conf |
| Access Restriction | Open |
| Subject Keyword | Database Scaling Distributed Shared-nothing Information Retrieval System Strong Effect Shared-nothing System Indexed Access Inverted Index Caching Information Retrieval System Single Host Folio System Second Way Introduction Information Retrieval System Computer Science Response Time Trace-driven Simulation Fixed Database Size Example Stanford University Information Vendor Inspec Database Optimal Configuration Host Architecture Common Class Optimal Number Electrical Engineering Physical Index Design |
| Content Type | Text |
| Resource Type | Technical Report |