The Opposite of Smoothing: A Language Model Approach to Ranking Query-Specific Document Clusters-Reference-Cited by-同舟云学术

The Opposite of Smoothing: A Language Model Approach to Ranking Query-Specific Document Clusters

Published:2011-07-29 Issue: Volume:41 Page:367-395
ISSN:1076-9757
Container-title:Journal of Artificial Intelligence Research
language:
Short-container-title:jair

Author:

Kurland O.,Krikon E.

Abstract

Exploiting information induced from (query-specific) clustering of top-retrieved documents has long been proposed as a means for improving precision at the very top ranks of the returned results. We present a novel language model approach to ranking query-specific clusters by the presumed percentage of relevant documents that they contain. While most previous cluster ranking approaches focus on the cluster as a whole, our model utilizes also information induced from documents associated with the cluster. Our model substantially outperforms previous approaches for identifying clusters containing a high relevant-document percentage. Furthermore, using the model to produce document ranking yields precision-at-top-ranks performance that is consistently better than that of the initial ranking upon which clustering is performed. The performance also favorably compares with that of a state-of-the-art pseudo-feedback-based retrieval method.

Publisher

AI Access Foundation

Subject

Artificial Intelligence

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Novel Framework for Improving the Correctness of Reference Answers to Enhance Results of ASAG Systems;SN Computer Science;2023-05-24

2. From Cluster Ranking to Document Ranking;Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval;2022-07-06

3. An End-to-End Efficient Lucene-Based Framework of Document/Information Retrieval;International Journal of Information Retrieval Research;2022-01

4. Relevance- and interface-driven clustering for visual information retrieval;Information Systems;2020-12

5. A passage-based approach to learning to rank documents;Information Retrieval Journal;2020-03-06