IR evaluation methods for retrieving highly relevant documents-Reference-Cited by-同舟云学术

IR evaluation methods for retrieving highly relevant documents

Published:2017-08-02 Issue:2 Volume:51 Page:243-250
ISSN:0163-5840
Container-title:ACM SIGIR Forum
language:en
Short-container-title:SIGIR Forum

Author:

Järvelin Kalervo¹,Kekäläinen Jaana¹

Affiliation:

1. University of Tampere, Finland

Abstract

This paper proposes evaluation methods based on the use of non-dichotomous relevance judgements in IR experiments. It is argued that evaluation methods should credit IR methods for their ability to retrieve highly relevant documents. This is desirable from the user point of view in modem large IR environments. The proposed methods are (1) a novel application of P-R curves and average precision computations based on separate recall bases for documents of different degrees of relevance, and (2) two novel measures computing the cumulative gain the user obtains by examining the retrieval result up to a given ranked position. We then demonstrate the use of these evaluation methods in a case study on the effectiveness of query types, based on combinations of query structures and expansion, in retrieving documents of various degrees of relevance. The test was run with a best match retrieval system (In- Query I) in a text database consisting of newspaper articles. The results indicate that the tested strong query structures are most effective in retrieving highly relevant documents. The differences between the query types are practically essential and statistically significant. More generally, the novel evaluation methods and the case demonstrate that non-dichotomous relevance assessments are applicable in IR experiments, may reveal interesting phenomena, and allow harder testing of IR methods.

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Management Information Systems

Link

https://dl.acm.org/doi/pdf/10.1145/3130348.3130374

Reference18 articles.

1. An evaluation of retrieval effectiveness for a full-text document-retrieval system

2. Measures of relative relevance and ranked half-life

3. THE EXPRESSION OF CONCEPTUAL SYNTAGMATIC RELATIONSHIPS: A COMPARATIVE SURVEY

Cited by 138 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Smart and user-centric manufacturing information recommendation using multimodal learning to support human-robot collaboration in mixed reality environments;Robotics and Computer-Integrated Manufacturing;2025-02

2. LegalAsst: Human-centered and AI-empowered machine to enhance court productivity and legal assistance;Information Sciences;2024-09

3. Reliable Confidence Intervals for Information Retrieval Evaluation Using Generative A.I.;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24

4. Are Large Language Models Good at Utility Judgments?;Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval;2024-07-10

5. Enhancing efficiency of protein language models with minimal wet-lab data through few-shot learning;Nature Communications;2024-07-02