CovRNN—A recurrent neural network model for predicting outcomes of COVID-19 patients: model development and validation using EHR data-Reference-Cited by-同舟云学术

CovRNN—A recurrent neural network model for predicting outcomes of COVID-19 patients: model development and validation using EHR data

Published:2021-09-29 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Rasmy Laila,Nigo Masayuki,Kannadath Bijun Sai,Xie Ziqian,Mao Bingyu,Patel Khush,Zhou Yujia,Zhang Wanheng,Ross Angela,Xu Hua,Zhi Degui

Abstract

ABSTRACTBackgroundPredicting outcomes of COVID-19 patients at an early stage is critical for optimized clinical care and resource management, especially during a pandemic. Although multiple machine learning models have been proposed to address this issue, based on the need for extensive data pre-processing and feature engineering, these models have not been validated or implemented outside of the original study site.MethodsIn this study, we propose CovRNN, recurrent neural network (RNN)-based models to predict COVID-19 patients’ outcomes, using their available electronic health record (EHR) data on admission, without the need for specific feature selection or missing data imputation. CovRNN is designed to predict three outcomes: in-hospital mortality, need for mechanical ventilation, and long length of stay (LOS >7 days). Predictions are made for time-to-event risk scores (survival prediction) and all-time risk scores (binary prediction). Our models were trained and validated using heterogeneous and de-identified data of 247,960 COVID-19 patients from 87 healthcare systems, derived from the Cerner® Real-World Dataset (CRWD). External validation was performed using three test sets (approximately 53,000 patients). Further, the transferability of CovRNN was validated using 36,140 de-identified patients’ data derived from the Optum® de-identified COVID-19 Electronic Health Record v. 1015 dataset (2007–2020).FindingsCovRNN shows higher performance than do traditional models. It achieved an area under the receiving operating characteristic (AUROC) of 93% for mortality and mechanical ventilation predictions on the CRWD test set (vs. 91·5% and 90% for light gradient boost machine (LGBM) and logistic regression (LR), respectively) and 86.5% for prediction of LOS > 7 days (vs. 81·7% and 80% for LGBM and LR, respectively). For survival prediction, CovRNN achieved a C-index of 86% for mortality and 92·6% for mechanical ventilation. External validation confirmed AUROCs in similar ranges.InterpretationTrained on a large heterogeneous real-world dataset, our CovRNN model showed high prediction accuracy, good calibration, and transferability through consistently good performance on multiple external datasets. Our results demonstrate the feasibility of a COVID-19 predictive model that delivers high accuracy without the need for complex feature engineering.

Publisher

Cold Spring Harbor Laboratory

Reference28 articles.

1. Coronavirus disease (COVID-19) – World Health Organization. https://www.who.int/emergencies/diseases/novel-coronavirus-2019 (accessed May 29, 2021).

2. CDC. COVID Data Tracker. 2020; published online March 28. https://covid.cdc.gov/covid-data-tracker (accessed March 28, 2021).

3. Prediction models for diagnosis and prognosis of covid-19: systematic review and critical appraisal

4. Prediction models for diagnosis and prognosis in Covid-19

5. Prediction models for COVID-19 clinical decision making;Lancet Digit Health,2020

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Transformer-Based Model Trained on Large Scale Claims Data for Prediction of Severe COVID-19 Disease Progression;2022-11-30

2. AI-aided dynamic prediction of bleeding and ischemic risk after coronary stenting and subsequent DAPT;2022-02-07