Efficient query processing techniques for next-page retrieval-Reference-Cited by-同舟云学术

Efficient query processing techniques for next-page retrieval

Published:2022-01-18 Issue:1 Volume:25 Page:27-43
ISSN:1386-4564
Container-title:Information Retrieval Journal
language:en
Short-container-title:Inf Retrieval J

Author:

Mackenzie Joel^ORCID,Petri Matthias,Moffat Alistair

Abstract

AbstractIn top-k ranked retrieval the goal is to efficiently compute an ordered list of the highest scoring k documents according to some stipulated similarity function such as the well-known BM25 approach. In most implementation techniques a min-heap of size k is used to track the top scoring candidates. In this work we consider the question of how best to retrieve the second page of search results, given that a first page has already been computed; that is, identification of the documents at ranks

$$k+1$$

k + 1 to 2k for some query. Our goal is to understand what information is available as a by-product of the first-page scoring, and how it can be employed to accelerate the second-page computation, assuming that the second-page of results is required for only a fraction of the query load. We propose a range of simple, yet efficient, next-page retrieval techniques which are suitable for accelerating Document-at-a-Time mechanisms, and demonstrate their performance on three large text collections.

Funder

Australian Research Council

University of Melbourne

Publisher

Springer Science and Business Media LLC

Subject

Library and Information Sciences,Information Systems

Link

https://link.springer.com/content/pdf/10.1007/s10791-021-09402-7.pdf

Reference43 articles.

1. Allan, J., Carterette, B., Aslam, J. A., Pavlu, V., Dachev, B., & Kanoulas, E. (2007). Million query track 2007 overview. In Proceedings of the text retrieval conference (TREC).

2. Allan, J., Aslam, J. A., Carterette, B., Pavlu, V., & Kanoulas, E. (2008). Million query track 2008 overview. In Proceedings of the text retrieval conference (TREC).

3. Azzopardi, L., & Zuccon, G. (2016). Two scrolls or one click: A cost model for browsing search results. In Proceedings of the European conference on information retrieval (ECIR) (pp. 696–702).

4. Broder, A. Z., Carmel, D., Herscovici, M., Soffer, A., & Zien, J. (2003). Efficient query evaluation using a two-level retrieval process. In Proceedings of the ACM international conference on information and knowledge management (CIKM) (pp. 426–434).

5. Carterette, B., Pavlu, V., Fang, H., & Kanoulas, E. (2009). Million query track 2009 overview. In Proceedings of the text retrieval conference (TREC).

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Many are Better than One: Algorithm Selection for Faster Top-K Retrieval;Information Processing & Management;2023-07

2. Managing and Retrieving Bilingual Documents Using Artificial Intelligence-Based Ontological Framework;Computational Intelligence and Neuroscience;2022-08-25

3. Faster Learned Sparse Retrieval with Guided Traversal;Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval;2022-07-06