Abstract
AbstractQuery performance prediction (QPP) has been studied extensively in the IR community over the last two decades. A by-product of this research is a methodology to evaluate the effectiveness of QPP techniques. In this paper, we re-examine the existing evaluation methodology commonly used for QPP, and propose a new approach. Our key idea is to model QPP performance as a distribution instead of relying on point estimates. To obtain such distribution, we exploit the scaled Absolute Ranking Error (sARE) measure, and its mean the scaled Mean Absolute Ranking Error (sMARE). Our work demonstrates important statistical implications, and overcomes key limitations imposed by the currently used correlation-based point-estimate evaluation approaches. We also explore the potential benefits of using multiple query formulations and ANalysis Of VAriance (ANOVA) modeling in order to measure interactions between multiple factors. The resulting statistical analysis combined with a novel evaluation framework demonstrates the merits of modeling QPP performance as distributions, and enables detailed statistical ANOVA models for comparative analyses to be created.
Funder
australian research council
Publisher
Springer Science and Business Media LLC
Subject
Library and Information Sciences,Information Systems
Reference59 articles.
1. Amati, G., Carpineto, C., & Romano, G. (2004). Query difficulty, robustness, and selective application of query expansion. In Proceedings of the ECIR (pp. 127–137). Springer.
2. Aslam, J. A., & Pavlu, V. (2007). Query hardness estimation using Jensen-Shannon divergence among multiple scoring functions. In Proceedings of the ECIR (pp. 198–209). Springer.
3. Bailey, P., Moffat, A., Scholer, F., & Thomas, P. (2016) UQV100: A test collection with query variability. In Proceedings of the SIGIR (pp 725–728).
4. Bailey, P., Moffat, A., Scholer, F., & Thomas, P. (2017) Retrieval consistency in the presence of query variations. In Proceedings of the SIGIR (pp 395–404).
5. Banks, D., Over, P., & Zhang, N. F. (1999). Blind men and elephants: Six approaches to TREC data. Information Retrieval, 1(1–2), 7–34.
Cited by
11 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献