Affiliation:
1. University College Dublin, Belfield, Dublin, Ireland
2. University of Glasgow, Glasgow, Scotland, United Kingdom
3. Indian Statistical Institute, Dunlop, Kolkata, West Bengal, India
Abstract
Query performance prediction (QPP) methods, which aim to predict the performance of a query, often rely on evidences in the form of different characteristic patterns in the distribution of Retrieval Status Values (RSVs). However, for neural IR models, it is usually observed that the RSVs are often less reliable for QPP because they are bounded within short intervals, different from the situation for statistical models. To address this limitation, we propose a model-agnostic QPP framework that gathers additional evidences by leveraging information from the characteristic patterns of RSV distributions computed over a set of
automatically generated
query variants, relative to that of the current query. Specifically, the idea behind our proposed method—Weighted Relative Information Gain (WRIG), is that a substantial relative decrease or increase in the standard deviation of the RSVs of the query variants is likely to be a relative indicator of how easy or difficult the original query is. To cater for the absence of human-annotated query variants in real-world scenarios, we further propose an automatic query variant generation method. This can produce variants in a controlled manner by substituting terms from the original query with new ones sampled from a weighted distribution, constructed either via a relevance model or with the help of an embedded representation of query terms. Our experiments on the TREC-Robust, ClueWeb09B, and MS MARCO datasets show that WRIG, by the use of this relative changes in QPP estimate, leads to significantly better results than a state-of-the-art baseline method that leverages information from (manually created) query variants by the application of additive smoothing [
64
]. The results also show that our approach can improve the QPP effectiveness of neural retrieval approaches in particular.
Funder
Science Foundation Ireland
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Science Applications,General Business, Management and Accounting,Information Systems
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. "In-Context Learning" or: How I learned to stop worrying and love "Applied Information Retrieval";Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval;2024-07-10
2. Optimization Methods for Personalizing Large Language Models through Retrieval Augmentation;Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval;2024-07-10
3. Query Performance Prediction: From Fundamentals to Advanced Techniques;Lecture Notes in Computer Science;2024
4. BertPE: A BERT-Based Pre-retrieval Estimator for Query Performance Prediction;Lecture Notes in Computer Science;2024
5. Noisy Perturbations for Estimating Query Difficulty in Dense Retrievers;Proceedings of the 32nd ACM International Conference on Information and Knowledge Management;2023-10-21