Electronic Medical Record–Based Machine Learning Approach to Predict the Risk of 30-Day Adverse Cardiac Events After Invasive Coronary Treatment: Machine Learning Model Development and Validation-Reference-Cited by-同舟云学术

Electronic Medical Record–Based Machine Learning Approach to Predict the Risk of 30-Day Adverse Cardiac Events After Invasive Coronary Treatment: Machine Learning Model Development and Validation

Published:2022-05-11 Issue:5 Volume:10 Page:e26801
ISSN:2291-9694
Container-title:JMIR Medical Informatics
language:en
Short-container-title:JMIR Med Inform

Author:

Kwon Osung^ORCID,Na Wonjun^ORCID,Kang Heejun^ORCID,Jun Tae Joon^ORCID,Kweon Jihoon^ORCID,Park Gyung-Min^ORCID,Cho YongHyun^ORCID,Hur Cinyoung^ORCID,Chae Jungwoo^ORCID,Kang Do-Yoon^ORCID,Lee Pil Hyung^ORCID,Ahn Jung-Min^ORCID,Park Duk-Woo^ORCID,Kang Soo-Jin^ORCID,Lee Seung-Whan^ORCID,Lee Cheol Whan^ORCID,Park Seong-Wook^ORCID,Park Seung-Jung^ORCID,Yang Dong Hyun^ORCID,Kim Young-Hak^ORCID

Abstract

Background Although there is a growing interest in prediction models based on electronic medical records (EMRs) to identify patients at risk of adverse cardiac events following invasive coronary treatment, robust models fully utilizing EMR data are limited. Objective We aimed to develop and validate machine learning (ML) models by using diverse fields of EMR to predict the risk of 30-day adverse cardiac events after percutaneous intervention or bypass surgery. Methods EMR data of 5,184,565 records of 16,793 patients at a quaternary hospital between 2006 and 2016 were categorized into static basic (eg, demographics), dynamic time-series (eg, laboratory values), and cardiac-specific data (eg, coronary angiography). The data were randomly split into training, tuning, and testing sets in a ratio of 3:1:1. Each model was evaluated with 5-fold cross-validation and with an external EMR-based cohort at a tertiary hospital. Logistic regression (LR), random forest (RF), gradient boosting machine (GBM), and feedforward neural network (FNN) algorithms were applied. The primary outcome was 30-day mortality following invasive treatment. Results GBM showed the best performance with area under the receiver operating characteristic curve (AUROC) of 0.99; RF had a similar AUROC of 0.98. AUROCs of FNN and LR were 0.96 and 0.93, respectively. GBM had the highest area under the precision-recall curve (AUPRC) of 0.80, and the AUPRCs of RF, LR, and FNN were 0.73, 0.68, and 0.63, respectively. All models showed low Brier scores of <0.1 as well as highly fitted calibration plots, indicating a good fit of the ML-based models. On external validation, the GBM model demonstrated maximal performance with an AUROC of 0.90, while FNN had an AUROC of 0.85. The AUROCs of LR and RF were slightly lower at 0.80 and 0.79, respectively. The AUPRCs of GBM, LR, and FNN were similar at 0.47, 0.43, and 0.41, respectively, while that of RF was lower at 0.33. Among the categories in the GBM model, time-series dynamic data demonstrated a high AUROC of >0.95, contributing majorly to the excellent results. Conclusions Exploiting the diverse fields of the EMR data set, the ML-based 30-day adverse cardiac event prediction models demonstrated outstanding results, and the applied framework could be generalized for various health care prediction models.

Publisher

JMIR Publications Inc.

Subject

Health Information Management,Health Informatics

Reference29 articles.

1. Heart Disease and Stroke Statistics—2019 Update: A Report From the American Heart Association

2. Cardiovascular Risk Prediction in Patients With Stable and Unstable Coronary Heart Disease

3. 2018 ESC/EACTS Guidelines on myocardial revascularization

4. Prediction of Long-Term Mortality After Percutaneous Coronary Intervention in Older Adults

5. Contemporary and evolving risk scoring algorithms for percutaneous coronary intervention

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Developing an Inpatient Electronic Medical Record Phenotype for Hospital-Acquired Pressure Injuries: Case Study Using Natural Language Processing Models;JMIR AI;2023-03-08

2. Developing an Inpatient Electronic Medical Record Phenotype for Hospital-Acquired Pressure Injuries: Case Study Using Natural Language Processing Models (Preprint);2022-07-24