When to Stop Reviewing in Technology-Assisted Reviews-Reference-Cited by-同舟云学术

When to Stop Reviewing in Technology-Assisted Reviews

Published:2020-10-31 Issue:4 Volume:38 Page:1-36
ISSN:1046-8188
Container-title:ACM Transactions on Information Systems
language:en
Short-container-title:ACM Trans. Inf. Syst.

Author:

Li Dan¹^ORCID,Kanoulas Evangelos¹

Affiliation:

1. University of Amsterdam, Amsterdam, The Netherlands

Abstract

Technology-Assisted Reviews (TAR) aim to expedite document reviewing (e.g., medical articles or legal documents) by iteratively incorporating machine learning algorithms and human feedback on document relevance. Continuous Active Learning (CAL) algorithms have demonstrated superior performance compared to other methods in efficiently identifying relevant documents. One of the key challenges for CAL algorithms is deciding when to stop displaying documents to reviewers. Existing work either lacks transparency—it provides an ad-hoc stopping point, without indicating how many relevant documents are still not found, or lacks efficiency by paying an extra cost to estimate the total number of relevant documents in the collection prior to the actual review. In this article, we handle the problem of deciding the stopping point of TAR under the continuous active learning framework by jointly training a ranking model to rank documents, and by conducting a “greedy” sampling to estimate the total number of relevant documents in the collection. We prove the unbiasedness of the proposed estimators under a with-replacement sampling design, while experimental results demonstrate that the proposed approach, similar to CAL, effectively retrieves relevant documents; but it also provides a transparent, accurate, and effective stopping point.

Funder

Dutch Research Council

China Scholarship Council

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Science Applications,General Business, Management and Accounting,Information Systems

Link

https://dl.acm.org/doi/pdf/10.1145/3411755

Reference40 articles.

1. Where to stop reading a ranked list?

2. A unified model for metasearch, pooling, and system evaluation

3. Measure-based metasearch

4. A statistical method for system evaluation using incomplete judgments

Cited by 24 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. RLStop: A Reinforcement Learning Stopping Method for TAR;Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval;2024-07-10

2. Contextualization with SPLADE for High Recall Retrieval;Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval;2024-07-10

3. Third Workshop on Augmented Intelligence in Technology-Assisted Review Systems (ALTARS);Lecture Notes in Computer Science;2024

4. Comparison of Tools and Methods for Technology-Assisted Review;Communications in Computer and Information Science;2024

5. Stopping Methods for Technology-assisted Reviews Based on Point Processes;ACM Transactions on Information Systems;2023-12-29