Affiliation:
1. Stanford University, Department of Computer Science, Margaret Jacks Hall, Stanford, CA
Abstract
A common class of existing information retrieval system provides access to abstracts. For example Stanford University, through its FOLIO system, provides access to the INSPECT database of abstracts of the literature on physics, computer science, electrical engineering, etc. In this paper this database is studied by using a trace-driven simulation. We focus on physical index design, inverted index caching, and database scaling in a distributed shared-nothing system. All three issues are shown to have a strong effect on response time and throughput. Database scaling is explored in two ways. One way assumes an “optimal” configuration for a single host and then linearly scales the database by duplicating the host architecture as needed. The second way determines the optimal number of hosts given a fixed database size.
Publisher
Association for Computing Machinery (ACM)
Subject
Information Systems,Software
Reference12 articles.
1. Parallel text searching in serial files using a processor farm
2. P. A. Erarath. Page Indzzing Jot Teztual Inyorma#ion Retrieval Systems. PhD thesis University of illinois at Urbane-Champaign October 1983. P. A. Erarath. Page Indzzing Jot Teztual Inyorma#ion Retrieval Systems. PhD thesis University of illinois at Urbane-Champaign October 1983.
3. C. Faloutsos. Access methods for text. A CM Computing Sur#Je!/s 17:50-74 1985. 10.1145/4078.4080 C. Faloutsos. Access methods for text. A CM Computing Sur#Je!/s 17:50-74 1985. 10.1145/4078.4080
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Maximizing Bigdata Retrieval: Block as a Value for NoSQL over SQL;2022 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM);2022-11-10
2. Resource-Efficient Index Shard Replication in Large Scale Search Engines;IEEE Transactions on Parallel and Distributed Systems;2019-12-01
3. Index Shard Replication Strategies for Improving Resource Utilization in Large Scale Search Engines;Proceedings of the 47th International Conference on Parallel Processing;2018-08-13
4. Efficient distributed selective search;Information Retrieval Journal;2016-11-25
5. Scalability Challenges in Web Search Engines;Synthesis Lectures on Information Concepts, Retrieval, and Services;2015-12-29