Semisupervised Deep Learning Techniques for Predicting Acute Respiratory Distress Syndrome From Time-Series Clinical Data: Model Development and Validation Study-Reference-Cited by-同舟云学术

Semisupervised Deep Learning Techniques for Predicting Acute Respiratory Distress Syndrome From Time-Series Clinical Data: Model Development and Validation Study

Published:2021-09-14 Issue:9 Volume:5 Page:e28028
ISSN:2561-326X
Container-title:JMIR Formative Research
language:en
Short-container-title:JMIR Form Res

Author:

Lam Carson^ORCID,Tso Chak Foon^ORCID,Green-Saxena Abigail^ORCID,Pellegrini Emily^ORCID,Iqbal Zohora^ORCID,Evans Daniel^ORCID,Hoffman Jana^ORCID,Calvert Jacob^ORCID,Mao Qingqing^ORCID,Das Ritankar^ORCID

Abstract

Background A high number of patients who are hospitalized with COVID-19 develop acute respiratory distress syndrome (ARDS). Objective In response to the need for clinical decision support tools to help manage the next pandemic during the early stages (ie, when limited labeled data are present), we developed machine learning algorithms that use semisupervised learning (SSL) techniques to predict ARDS development in general and COVID-19 populations based on limited labeled data. Methods SSL techniques were applied to 29,127 encounters with patients who were admitted to 7 US hospitals from May 1, 2019, to May 1, 2021. A recurrent neural network that used a time series of electronic health record data was applied to data that were collected when a patient’s peripheral oxygen saturation level fell below the normal range (<97%) to predict the subsequent development of ARDS during the remaining duration of patients’ hospital stay. Model performance was assessed with the area under the receiver operating characteristic curve and area under the precision recall curve of an external hold-out test set. Results For the whole data set, the median time between the first peripheral oxygen saturation measurement of <97% and subsequent respiratory failure was 21 hours. The area under the receiver operating characteristic curve for predicting subsequent ARDS development was 0.73 when the model was trained on a labeled data set of 6930 patients, 0.78 when the model was trained on the labeled data set that had been augmented with the unlabeled data set of 16,173 patients by using SSL techniques, and 0.84 when the model was trained on the entire training set of 23,103 labeled patients. Conclusions In the context of using time-series inpatient data and a careful model training design, unlabeled data can be used to improve the performance of machine learning models when labeled data for predicting ARDS development are scarce or expensive.

Publisher

JMIR Publications Inc.

Subject

Computer Science Applications,Health Informatics,Medicine (miscellaneous)

Reference33 articles.

1. Incidence and Outcomes of Acute Lung Injury

2. Epidemiology, Patterns of Care, and Mortality for Patients With Acute Respiratory Distress Syndrome in Intensive Care Units in 50 Countries

3. The acute respiratory distress syndrome

4. Acute respiratory distress syndrome: Underrecognition by clinicians

5. Embracing the Heterogeneity of ARDS

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Machine Learning Tools for Acute Respiratory Distress Syndrome Detection and Prediction;Critical Care Medicine;2024-08-12

2. Automatic ARDS surveillance with chest X-ray recognition using convolutional neural networks;Journal of Critical Care;2024-08

3. A systematic review of machine learning models for management, prediction and classification of ARDS;Respiratory Research;2024-06-04

4. A Novel Method for Medical Predictive Models in Small Data Using Out-of-Distribution Data and Transfer Learning;Mathematics;2024-01-11

5. Lung Imaging and Artificial Intelligence in ARDS;Journal of Clinical Medicine;2024-01-05