A question-answering framework for automated abstract screening using large language models-Reference-Cited by-同舟云学术

A question-answering framework for automated abstract screening using large language models

Published:2024-07-23 Issue:9 Volume:31 Page:1939-1952
ISSN:1067-5027
Container-title:Journal of the American Medical Informatics Association
language:en
Short-container-title:

Author:

Akinseloyin Opeoluwa¹,Jiang Xiaorui²^ORCID,Palade Vasile¹^ORCID

Affiliation:

1. Centre for Computational Science and Mathematical Modelling, Coventry University , Coventry CV1 2TT, United Kingdom

2. Information School, The University of Sheffield , Sheffield S10 2AH, United Kingdom

Abstract

Abstract Objective This paper aims to address the challenges in abstract screening within systematic reviews (SR) by leveraging the zero-shot capabilities of large language models (LLMs). Methods We employ LLM to prioritize candidate studies by aligning abstracts with the selection criteria outlined in an SR protocol. Abstract screening was transformed into a novel question-answering (QA) framework, treating each selection criterion as a question addressed by LLM. The framework involves breaking down the selection criteria into multiple questions, properly prompting LLM to answer each question, scoring and re-ranking each answer, and combining the responses to make nuanced inclusion or exclusion decisions. Results and Discussion Large-scale validation was performed on the benchmark of CLEF eHealth 2019 Task 2: Technology-Assisted Reviews in Empirical Medicine. Focusing on GPT-3.5 as a case study, the proposed QA framework consistently exhibited a clear advantage over traditional information retrieval approaches and bespoke BERT-family models that were fine-tuned for prioritizing candidate studies (ie, from the BERT to PubMedBERT) across 31 datasets of 4 categories of SRs, underscoring their high potential in facilitating abstract screening. The experiments also showcased the viability of using selection criteria as a query for reference prioritization. The experiments also showcased the viability of the framework using different LLMs. Conclusion Investigation justified the indispensable value of leveraging selection criteria to improve the performance of automated abstract screening. LLMs demonstrated proficiency in prioritizing candidate studies for abstract screening using the proposed QA framework. Significant performance improvements were obtained by re-ranking answers using the semantic alignment between abstracts and selection criteria. This further highlighted the pertinence of utilizing selection criteria to enhance abstract screening.

Funder

Coventry University

National Planning Office of Philosophy and Social Science of China

International Exchange Scheme

Royal Society of the United Kingdom

Research Excellence Development Framework award of Coventry University

Publisher

Oxford University Press (OUP)

Link

https://academic.oup.com/jamia/article-pdf/31/9/1939/58868008/ocae166.pdf

Reference73 articles.

1. Systematic review automation technologies;Tsafnat;Syst Rev,2014

2. Systematic reviews and meta-analysis: understanding the best evidence in primary healthcare;Gopalakrishnan;J Family Med Prim Care,2013

3. The rationale behind systematic reviews in clinical medicine: a conceptual framework;Moosapour;J Diabetes Metab Disord,2021

4. Use of cost-effectiveness analysis to compare the efficiency of study identification methods in systematic reviews;Shemilt;Syst Rev,2016

5. The significant cost of systematic reviews and meta-analyses: a call for greater involvement of machine learning to assess the promise of clinical trials;Michelson;Contemp Clin Trials Commun.,2019

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Large language models in biomedicine and health: current research landscape and future directions;Journal of the American Medical Informatics Association;2024-08-22