Machine learning algorithms for systematic review: reducing workload in a preclinical review of animal studies and reducing human screening error-Reference-Cited by-同舟云学术

Machine learning algorithms for systematic review: reducing workload in a preclinical review of animal studies and reducing human screening error

Published:2018-01-31 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Bannach-Brown Alexandra^ORCID,Przybyła Piotr,Thomas James^ORCID,Rice Andrew S.C.^ORCID,Ananiadou Sophia,Liao Jing,Macleod Malcolm Robert^ORCID

Abstract

AbstractBackgroundHere we outline a method of applying existing machine learning (ML) approaches to aid citation screening in an on-going broad and shallow systematic review of preclinical animal studies, with the aim of achieving a high performing algorithm comparable to human screening.MethodsWe applied ML approaches to a broad systematic review of animal models of depression at the citation screening stage. We tested two independently developed ML approaches which used different classification models and feature sets. We recorded the performance of the ML approaches on an unseen validation set of papers using sensitivity, specificity and accuracy. We aimed to achieve 95% sensitivity and to maximise specificity. The classification model providing the most accurate predictions was applied to the remaining unseen records in the dataset and will be used in the next stage of the preclinical biomedical sciences systematic review. We used a cross validation technique to assign ML inclusion likelihood scores to the human screened records, to identify potential errors made during the human screening process (error analysis).ResultsML approaches reached 98.7% sensitivity based on learning from a training set of 5749 records, with an inclusion prevalence of 13.2%. The highest level of specificity reached was 86%. Performance was assessed on an independent validation dataset. Human errors in the training and validation sets were successfully identified using assigned the inclusion likelihood from the ML model to highlight discrepancies. Training the ML algorithm on the corrected dataset improved the specificity of the algorithm without compromising sensitivity. Error analysis correction leads to a 3% improvement in sensitivity and specificity, which increases precision and accuracy of the ML algorithm.ConclusionsThis work has confirmed the performance and application of ML algorithms for screening in systematic reviews of preclinical animal studies. It has highlighted the novel use of ML algorithms to identify human error. This needs to be confirmed in other reviews, , but represents a promising approach to integrating human decisions and automation in systematic review methodology.

Publisher

Cold Spring Harbor Laboratory

Reference47 articles.

1. Growth rates of modern science: A bibliometric analysis based on the number of publications and cited references;Journal of the Association for Information Science and Technology,2015

2. Cohen, A. M. , Adams, C. E. , Davis, J. M. , Yu, C. , Yu, P. S. , Meng, W. , … & Smalheiser, N. R. (2010, November). Evidence-based medicine, the essential role of systematic reviews, and the need for automated text mining tools. In Proceedings of the 1st ACM international Health Informatics Symposium (pp. 376–380). ACM.

3. SWIFT-Review: a text-mining workbench for systematic review;Systematic reviews,2016

4. Systematic review automation technologies;Systematic reviews,2014

5. Using text mining for study identification in systematic reviews: a systematic review of current approaches;Systematic reviews,2015

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Systematic evidence map (SEM) template: Report format and methods used for the US EPA Integrated Risk Information System (IRIS) program, Provisional Peer Reviewed Toxicity Value (PPRTV) program, and other “fit for purpose” literature-based human health analyses;Environment International;2022-11

2. Pediatric Chronic Critical Illness: Protocol for a Scoping Review (Preprint);2021-05-27

3. Iterative guided machine learning-assisted systematic literature reviews: a diabetes case study;Systematic Reviews;2021-04-02

4. Application of systematic evidence mapping to assess the impact of new research when updating health reference values: A case example using acrolein;Environment International;2020-10

5. Machine Learning and Big Data Provide Crucial Insight for Future Biomaterials Discovery and Research;SSRN Electronic Journal;2020