An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review-Reference-Cited by-同舟云学术

An Assessment of the Predictive Performance of Current Machine Learning–Based Breast Cancer Risk Prediction Models: Systematic Review

Published:2022-12-29 Issue:12 Volume:8 Page:e35750
ISSN:2369-2960
Container-title:JMIR Public Health and Surveillance
language:en
Short-container-title:JMIR Public Health Surveill

Author:

Gao Ying^ORCID,Li Shu^ORCID,Jin Yujing^ORCID,Zhou Lengxiao^ORCID,Sun Shaomei^ORCID,Xu Xiaoqian^ORCID,Li Shuqian^ORCID,Yang Hongxi^ORCID,Zhang Qing^ORCID,Wang Yaogang^ORCID

Abstract

Background Several studies have explored the predictive performance of machine learning–based breast cancer risk prediction models and have shown controversial conclusions. Thus, the performance of the current machine learning–based breast cancer risk prediction models and their benefits and weakness need to be evaluated for the future development of feasible and efficient risk prediction models. Objective The aim of this review was to assess the performance and the clinical feasibility of the currently available machine learning–based breast cancer risk prediction models. Methods We searched for papers published until June 9, 2021, on machine learning–based breast cancer risk prediction models in PubMed, Embase, and Web of Science. Studies describing the development or validation models for predicting future breast cancer risk were included. The Prediction Model Risk of Bias Assessment Tool (PROBAST) was used to assess the risk of bias and the clinical applicability of the included studies. The pooled area under the curve (AUC) was calculated using the DerSimonian and Laird random-effects model. Results A total of 8 studies with 10 data sets were included. Neural network was the most common machine learning method for the development of breast cancer risk prediction models. The pooled AUC of the machine learning–based optimal risk prediction model reported in each study was 0.73 (95% CI 0.66-0.80; approximate 95% prediction interval 0.56-0.96), with a high level of heterogeneity between studies (Q=576.07, I2=98.44%; P<.001). The results of head-to-head comparison of the performance difference between the 2 types of models trained by the same data set showed that machine learning models had a slightly higher advantage than traditional risk factor–based models in predicting future breast cancer risk. The pooled AUC of the neural network–based risk prediction model was higher than that of the nonneural network–based optimal risk prediction model (0.71 vs 0.68, respectively). Subgroup analysis showed that the incorporation of imaging features in risk models resulted in a higher pooled AUC than the nonincorporation of imaging features in risk models (0.73 vs 0.61; Pheterogeneity=.001, respectively). The PROBAST analysis indicated that many machine learning models had high risk of bias and poorly reported calibration analysis. Conclusions Our review shows that the current machine learning–based breast cancer risk prediction models have some technical pitfalls and that their clinical feasibility and reliability are unsatisfactory.

Publisher

JMIR Publications Inc.

Subject

Public Health, Environmental and Occupational Health,Health Informatics

Reference39 articles.

1. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries

2. Breast-Cancer Screening — Viewpoint of the IARC Working Group

3. Impact of Screening on Breast Cancer Mortality: The UK Program 20 Years On

4. Cost effectiveness of breast cancer screening and prevention: a systematic review with a focus on risk-adapted strategies

5. Is risk-stratified breast cancer screening economically efficient in Germany?

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Development and Validation of an Interpretable Machine Learning Model for Early Prognosis Prediction in ICU Patients with Malignant Tumors and Hyperkalemia;Medicine;2024-07-26

2. Artificial Intelligence for Breast Cancer Risk Assessment;Radiologic Clinics of North America;2024-07

3. Analyzing the Performance of Explainable Machine Learning Models in Risk Factor Identification for Major Cancers (Preprint);2024-06-02

4. The construction, validation and promotion of the nomogram prognosis prediction model of UCEC, and the experimental verification of the expression and knockdown of the key gene GPX4;Heliyon;2024-01

5. Multiple Disease Prediction Using Machine Learning Techniques: A Comparative Analysis;Lecture Notes in Networks and Systems;2024