LLMscreen: A Python Package for Systematic Review Screening of Scientific Texts Using Prompt Engineering-Reference-Cited by-同舟云学术

LLMscreen: A Python Package for Systematic Review Screening of Scientific Texts Using Prompt Engineering

Published:2024-09-11 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Xia Ziqian¹^ORCID,Ye Jinquan²^ORCID,Hu Bo³,Qiang Qiqi⁴,Debnath Ramit⁵

Affiliation:

1. 1. School of Economics and Management, Tongji University

2. Nicholas School of the Environment, Duke University

3. School of Social and Behavioral Sciences, Nanjing University

4. School of Humanities and Social Science, The Chinese University of Hong Kong

5. Collective Intelligence & Design Group, University of Cambridge

Abstract

Systematic reviews represent a cornerstone of evidence-based research, yet the process is labor-intensive and time-consuming, often requiring substantial human resources. The advent of Large Language Models (LLMs) offers a novel approach to streamlining systematic reviews, particularly in the title and abstract screening phase. This study introduces a new Python package built on LLMs to accelerate this process, evaluating its performance across three datasets using distinct prompt strategies: single-prompt, k-value setting, and zero-shot. The k-value setting approach emerged as the most effective, achieving a precision of 0.649 and reducing the average error rate to 0.4%, significantly lower than the 10.76% error rate typically observed among human reviewers. Moreover, this approach enabled the screening of 3,000 papers in under 8 minutes, at a cost of only $0.30—an over 250-fold improvement in time and 2,000-fold cost efficiency compared to traditional methods. These findings underscore the potential of LLMs to enhance the efficiency and accuracy of systematic reviews, though further research is needed to address challenges related to dataset variability and model transparency. Expanding the application of LLMs to other stages of systematic reviews, such as data extraction and synthesis, could further streamline the review process, making it more comprehensive and less burdensome for researchers.

Funder

Bill and Melinda Gates Foundation

Publisher

Springer Science and Business Media LLC

Reference51 articles.

1. Down with the Hierarchies;Stegenga J;Topoi,2014

2. Analysis of the time and workers needed to conduct systematic reviews of medical interventions using data from the PROSPERO registry;Borah R;BMJ Open,2017

3. The significant cost of systematic reviews and meta-analyses: A call for greater involvement of machine learning to assess the promise of clinical trials;Michelson M;Contemp Clin Trials Commun,2019

4. Data Mining of Systematic Reviews 1934–2023: A Bibliometric Analysis: Data Mining of Systematic Reviews 1934–2023;Al-Abdallat H;High Yield Med Reviews,2023

5. Nearly 80 systematic reviews were published each day: Observational study on trends in epidemiology and reporting over the years 2000–2019;Hoffmann F;J Clin Epidemiol,2021