Risk of bias in studies on prediction models developed using supervised machine learning techniques: systematic review-Reference-Cited by-同舟云学术

Risk of bias in studies on prediction models developed using supervised machine learning techniques: systematic review

Published:2021-10-20 Issue: Volume: Page:n2281
ISSN:1756-1833
Container-title:BMJ
language:en
Short-container-title:BMJ

Author:

Andaur Navarro Constanza L^ORCID,Damen Johanna A A^ORCID,Takada Toshihiko^ORCID,Nijman Steven W J^ORCID,Dhiman Paula^ORCID,Ma Jie^ORCID,Collins Gary S^ORCID,Bajpai Ram^ORCID,Riley Richard D^ORCID,Moons Karel G M^ORCID,Hooft Lotty^ORCID

Abstract

Abstract Objective To assess the methodological quality of studies on prediction models developed using machine learning techniques across all medical specialties. Design Systematic review. Data sources PubMed from 1 January 2018 to 31 December 2019. Eligibility criteria Articles reporting on the development, with or without external validation, of a multivariable prediction model (diagnostic or prognostic) developed using supervised machine learning for individualised predictions. No restrictions applied for study design, data source, or predicted patient related health outcomes. Review methods Methodological quality of the studies was determined and risk of bias evaluated using the prediction risk of bias assessment tool (PROBAST). This tool contains 21 signalling questions tailored to identify potential biases in four domains. Risk of bias was measured for each domain (participants, predictors, outcome, and analysis) and each study (overall). Results 152 studies were included: 58 (38%) included a diagnostic prediction model and 94 (62%) a prognostic prediction model. PROBAST was applied to 152 developed models and 19 external validations. Of these 171 analyses, 148 (87%, 95% confidence interval 81% to 91%) were rated at high risk of bias. The analysis domain was most frequently rated at high risk of bias. Of the 152 models, 85 (56%, 48% to 64%) were developed with an inadequate number of events per candidate predictor, 62 handled missing data inadequately (41%, 33% to 49%), and 59 assessed overfitting improperly (39%, 31% to 47%). Most models used appropriate data sources to develop (73%, 66% to 79%) and externally validate the machine learning based prediction models (74%, 51% to 88%). Information about blinding of outcome and blinding of predictors was, however, absent in 60 (40%, 32% to 47%) and 79 (52%, 44% to 60%) of the developed models, respectively. Conclusion Most studies on machine learning based prediction models show poor methodological quality and are at high risk of bias. Factors contributing to risk of bias include small study size, poor handling of missing data, and failure to deal with overfitting. Efforts to improve the design, conduct, reporting, and validation of such studies are necessary to boost the application of machine learning based prediction models in clinical practice. Systematic review registration PROSPERO CRD42019161764.

Publisher

BMJ

Subject

General Engineering

Reference39 articles.

1. Prognosis and prognostic research: what, why, and how?

2. Prognosis Research Strategy (PROGRESS) 3: Prognostic Model Research

3. Prognosis Research in Health Care

4. Clinical Prediction Models

5. Reducing waste from incomplete or unusable reports of biomedical research

Cited by 176 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Privacy-preserving federated machine learning on FAIR health data: A real-world application;Computational and Structural Biotechnology Journal;2024-12

2. Age-stratified predictions of suicide attempts using machine learning in middle and late adolescence;Journal of Affective Disorders;2024-11

3. Machine learning applications in precision medicine: Overcoming challenges and unlocking potential;TrAC Trends in Analytical Chemistry;2024-10

4. Availability of Evidence for Predictive Machine Learning Algorithms in Primary Care;JAMA Network Open;2024-09-12

5. Developing clinical prediction models: a step-by-step guide;BMJ;2024-09-03