Comparison of discrimination and calibration performance of ECG-based machine learning models for prediction of new-onset atrial fibrillation-Reference-Cited by-同舟云学术

Comparison of discrimination and calibration performance of ECG-based machine learning models for prediction of new-onset atrial fibrillation

Published:2023-07-22 Issue:1 Volume:23 Page:
ISSN:1471-2288
Container-title:BMC Medical Research Methodology
language:en
Short-container-title:BMC Med Res Methodol

Author:

Baj Giovanni,Gandin Ilaria,Scagnetto Arjuna,Bortolussi Luca,Cappelletto Chiara,Di Lenarda Andrea,Barbati Giulia

Abstract

AbstractBackgroundMachine learning (ML) methods to build prediction models starting from electrocardiogram (ECG) signals are an emerging research field. The aim of the present study is to investigate the performances of two ML approaches based on ECGs for the prediction of new-onset atrial fibrillation (AF), in terms of discrimination, calibration and sample size dependence.MethodsWe trained two models to predict new-onset AF: a convolutional neural network (CNN), that takes as input the raw ECG signals, and an eXtreme Gradient Boosting model (XGB), that uses the signal’s extracted features. A penalized logistic regression model (LR) was used as a benchmark. Discrimination was evaluated with the area under the ROC curve, while calibration with the integrated calibration index. We investigated the dependence of models’ performances on the sample size and on class imbalance corrections introduced with random under-sampling.ResultsCNN's discrimination was the most affected by the sample size, outperforming XGB and LR only aroundn = 10.000 observations. Calibration showed only a small dependence on the sample size for all the models considered.Balancing the training set with random undersampling did not improve discrimination in any of the models. Instead, the main effect of imbalance corrections was to worsen the models’ calibration (for CNN, integrated calibration index from 0.014 [0.01, 0.018] to 0.17 [0.16, 0.19]).The sample size emerged as a fundamental point for developing the CNN model, especially in terms of discrimination (AUC = 0.75 [0.73, 0.77] whenn = 10.000, AUC = 0.80 [0.79, 0.81] whenn = 150.000). The effect of the sample size on the other two models was weaker. Imbalance corrections led to poorly calibrated models, for all the approaches considered, reducing the clinical utility of the models.ConclusionsOur results suggest that the choice of approach in the analysis of ECG should be based on the amount of data available, preferring more standard models for small datasets. Moreover, imbalance correction methods should be avoided when developing clinical prediction models, where calibration is crucial.

Publisher

Springer Science and Business Media LLC

Subject

Health Informatics,Epidemiology

Link

https://link.springer.com/content/pdf/10.1186/s12874-023-01989-3.pdf

Reference39 articles.

1. Mincholé A, Camps J, Lyon A, Rodríguez B. Machine learning in the electrocardiogram. J Electrocardiol. 2019;57:S61–4. https://doi.org/10.1016/j.jelectrocard.2019.08.008.

2. Siontis KC, Noseworthy PA, Attia ZI, Friedman PA. Artificial intelligence-enhanced electrocardiography in cardiovascular disease management. Nat Rev Cardiol. 2021;18(7):465–78. https://doi.org/10.1038/s41569-020-00503-2.

3. Alonso A, et al. Simple risk model predicts incidence of atrial fibrillation in a racially and geographically diverse population: the CHARGE-AF Consortium. J Am Heart Assoc. 2013;2(2):e000102.

4. Wesselius FJ, van Schie MS, De Groot NMS, Hendriks RC. Digital biomarkers and algorithms for detection of atrial fibrillation using surface electrocardiograms: a systematic review. Comput Biol Med. 2021;133:104404.

5. Bouzid Z, et al. Novel ECG features and machine learning to optimize culprit lesion detection in patients with suspected acute coronary syndrome. J Electrocardiol. 2021;69:31–7. https://doi.org/10.1016/j.jelectrocard.2021.07.012.

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Fed-CL- an atrial fibrillation prediction system using ECG signals employing federated learning mechanism;Scientific Reports;2024-09-09

2. AI-Enhanced ECG Applications in Cardiology: Comprehensive Insights from the Current Literature with a Focus on COVID-19 and Multiple Cardiovascular Conditions;Diagnostics;2024-08-23

3. Revolutionizing Cardiology through Artificial Intelligence—Big Data from Proactive Prevention to Precise Diagnostics and Cutting-Edge Treatment—A Comprehensive Review of the Past 5 Years;Diagnostics;2024-05-26

4. Artificial intelligence for atrial fibrillation detection, prediction, and treatment: A systematic review of the last decade (2013–2023);WIREs Data Mining and Knowledge Discovery;2024-02-04