About learning models with multiple query-dependent features-Reference-Cited by-同舟云学术

About learning models with multiple query-dependent features

Published:2013-07 Issue:3 Volume:31 Page:1-39
ISSN:1046-8188
Container-title:ACM Transactions on Information Systems
language:en
Short-container-title:ACM Trans. Inf. Syst.

Author:

Macdonald Craig¹,Santos Rodrygo L.T.¹,Ounis Iadh¹,He Ben²

Affiliation:

1. University of Glasgow, Glasgow, Scotland, U.K.

2. University of Chinese Academy of Sciences

Abstract

Several questions remain unanswered by the existing literature concerning the deployment of query-dependent features within learning to rank. In this work, we investigate three research questions in order to empirically ascertain best practices for learning-to-rank deployments. (i) Previous work in data fusion that pre-dates learning to rank showed that while different retrieval systems could be effectively combined, the combination of multiple models within the same system was not as effective. In contrast, the existing learning-to-rank datasets (e.g., LETOR), often deploy multiple weighting models as query-dependent features within a single system, raising the question as to whether such a combination is needed. (ii) Next, we investigate whether the training of weighting model parameters, traditionally required for effective retrieval, is necessary within a learning-to-rank context. (iii) Finally, we note that existing learning-to-rank datasets use weighting model features calculated on different fields (e.g., title, content, or anchor text), even though such weighting models have been criticized in the literature. Experiments addressing these three questions are conducted on Web search datasets, using various weighting models as query-dependent and typical query-independent features, which are combined using three learning-to-rank techniques. In particular, we show and explain why multiple weighting models should be deployed as features. Moreover, we unexpectedly find that training the weighting model's parameters degrades learned model's effectiveness. Finally, we show that computing a weighting model separately for each field is less effective than more theoretically-sound field-based weighting models.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Science Applications,General Business, Management and Accounting,Information Systems

Link

https://dl.acm.org/doi/pdf/10.1145/2493175.2493176

Reference63 articles.

1. Fusion of effective retrieval strategies in the same information retrieval system

2. Quality-biased ranking of web documents

3. Efficient query evaluation using a two-level retrieval process

Cited by 29 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A set of novel HTML document quality features for Web information retrieval: Including applications to learning to rank for information retrieval;Expert Systems with Applications;2024-07

2. Selective Query Processing: A Risk-Sensitive Selection of Search Configurations;ACM Transactions on Information Systems;2023-08-21

3. How to build high quality L2R training data: Unsupervised compression-based selective sampling for learning to rank;Information Sciences;2022-07

4. Defining an Optimal Configuration Set for Selective Search Strategy - A Risk-Sensitive Approach;Proceedings of the 30th ACM International Conference on Information & Knowledge Management;2021-10-26

5. Feature Extraction for Large-Scale Text Collections;Proceedings of the 29th ACM International Conference on Information & Knowledge Management;2020-10-19