Text mining to support abstract screening for knowledge syntheses: a semi-automated workflow-Reference-Cited by-同舟云学术

Text mining to support abstract screening for knowledge syntheses: a semi-automated workflow

Published:2021-05-26 Issue:1 Volume:10 Page:
ISSN:2046-4053
Container-title:Systematic Reviews
language:en
Short-container-title:Syst Rev

Author:

Pham Ba’,Jovanovic Jelena,Bagheri Ebrahim,Antony Jesmin,Ashoor Huda,Nguyen Tam T.,Rios Patricia,Robson Reid,Thomas Sonia M.,Watt Jennifer,Straus Sharon E.,Tricco Andrea C.^ORCID

Abstract

Abstract Background Current text mining tools supporting abstract screening in systematic reviews are not widely used, in part because they lack sensitivity and precision. We set out to develop an accessible, semi-automated “workflow” to conduct abstract screening for systematic reviews and other knowledge synthesis methods. Methods We adopt widely recommended text-mining and machine-learning methods to (1) process title-abstracts into numerical training data; and (2) train a classification model to predict eligible abstracts. The predicted abstracts are screened by human reviewers for (“true”) eligibility, and the newly eligible abstracts are used to identify similar abstracts, using near-neighbor methods, which are also screened. These abstracts, as well as their eligibility results, are used to update the classification model, and the above steps are iterated until no new eligible abstracts are identified. The workflow was implemented in R and evaluated using a systematic review of insulin formulations for type-1 diabetes (14,314 abstracts) and a scoping review of knowledge-synthesis methods (17,200 abstracts). Workflow performance was evaluated against the recommended practice of screening abstracts by 2 reviewers, independently. Standard measures were examined: sensitivity (inclusion of all truly eligible abstracts), specificity (exclusion of all truly ineligible abstracts), precision (inclusion of all truly eligible abstracts among all abstracts screened as eligible), F1-score (harmonic average of sensitivity and precision), and accuracy (correctly predicted eligible or ineligible abstracts). Workload reduction was measured as the hours the workflow saved, given only a subset of abstracts needed human screening. Results With respect to the systematic and scoping reviews respectively, the workflow attained 88%/89% sensitivity, 99%/99% specificity, 71%/72% precision, an F1-score of 79%/79%, 98%/97% accuracy, 63%/55% workload reduction, with 12%/11% fewer abstracts for full-text retrieval and screening, and 0%/1.5% missed studies in the completed reviews. Conclusion The workflow was a sensitive, precise, and efficient alternative to the recommended practice of screening abstracts with 2 reviewers. All eligible studies were identified in the first case, while 6 studies (1.5%) were missed in the second that would likely not impact the review’s conclusions. We have described the workflow in language accessible to reviewers with limited exposure to natural language processing and machine learning, and have made the code available to reviewers.

Funder

Ontario Ministry of Research, Innovation and Science

Canada Excellence Research Chairs, Government of Canada

Natural Sciences and Engineering Research Council of Canada

Publisher

Springer Science and Business Media LLC

Subject

Medicine (miscellaneous)

Link

https://link.springer.com/content/pdf/10.1186/s13643-021-01700-x.pdf

Reference51 articles.

1. Higgins J, Green S. Cochrane handbook for systematic reviews of interventions Version 5.1.0. The Cochrane Collaboration; 2011.

2. Allen IE, Olkin I. Estimating time to conduct a meta-analysis from number of citations retrieved. Jama. 1999;282(7):634–5. https://doi.org/10.1001/jama.282.7.634.

3. Borah R, Brown AW, Capers PL, Kaiser KA. Analysis of the time and workers needed to conduct systematic reviews of medical interventions using data from the PROSPERO registry. BMJ Open. 2017;7:e012545.

4. Petticrew M, Roberts H. Systematic reviews in the social sciences: A practical guide. Malden: Blackwell Publishing Co.; 2006. https://doi.org/10.1002/9780470754887.

5. O'Connor AM, Tsafnat G, Gilbert SB, Thayer KA, Wolfe MS. Moving toward the automation of the systematic review process: a summary of discussions at the second meeting of International Collaboration for the Automation of Systematic Reviews (ICASR). Syst Rev. 2018;7(1):3. https://doi.org/10.1186/s13643-017-0667-4.

Cited by 31 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Human-Comparable Sensitivity of Large Language Models in Identifying Eligible Studies Through Title and Abstract Screening: 3-Layer Strategy Using GPT-3.5 and GPT-4 for Systematic Reviews;Journal of Medical Internet Research;2024-08-16

2. Machine learning enables automated screening for systematic reviews and meta-analysis in urology;World Journal of Urology;2024-07-10

3. Automation of systematic reviews of biomedical literature: a scoping review of studies indexed in PubMed;Systematic Reviews;2024-07-08

4. Literature Filtering for Systematic Reviews with Transformers;2024 2nd International Conference on Communications, Computing and Artificial Intelligence;2024-06-21

5. Title and abstract screening for literature reviews using large language models: an exploratory study in the biomedical domain;Systematic Reviews;2024-06-15