Improved patient mortality predictions in emergency departments with deep learning data-synthesis and ensemble models-Reference-Cited by-同舟云学术

Improved patient mortality predictions in emergency departments with deep learning data-synthesis and ensemble models

Published:2023-09-12 Issue:1 Volume:13 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Son Byounghoon,Myung Jinwoo,Shin Younghwan,Kim Sangdo,Kim Sung Hyun,Chung Jong-Moon,Noh Jiyoung,Cho Junho,Chung Hyun Soo

Abstract

AbstractThe triage process in emergency departments (EDs) relies on the subjective assessment of medical practitioners, making it unreliable in certain aspects. There is a need for a more accurate and objective algorithm to determine the urgency of patients. This paper explores the application of advanced data-synthesis algorithms, machine learning (ML) algorithms, and ensemble models to predict patient mortality. Patients predicted to be at risk of mortality are in a highly critical condition, signifying an urgent need for immediate medical intervention. This paper aims to determine the most effective method for predicting mortality by enhancing the F1 score while maintaining high area under the receiver operating characteristic curve (AUC) score. This study used a dataset of 7325 patients who visited the Yonsei Severance Hospital’s ED, located in Seoul, South Korea. The patients were divided into two groups: patients who deceased in the ED and patients who didn’t. Various data-synthesis techniques, such as SMOTE, ADASYN, CTGAN, TVAE, CopulaGAN, and Gaussian Copula, were deployed to generate synthetic patient data. Twenty two ML models were then utilized, including tree-based algorithms like Decision tree, AdaBoost, LightGBM, CatBoost, XGBoost, NGBoost, TabNet, which are deep neural network algorithms, and statistical algorithms such as Support Vector Machine, Logistic Regression, Random Forest, k-nearest neighbors, and Gaussian Naive Bayes, as well as Ensemble Models which use the results from the ML models. Based on 21 patient information features used in the pandemic influenza triage algorithm (PITA), the models explained previously were applied to aim for the prediction of patient mortality. In evaluating ML algorithms using an imbalanced medical dataset, conventional metrics like accuracy scores or AUC can be misleading. This paper emphasizes the importance of using the F1 score as the primary performance measure, focusing on recall and specificity in detecting patient mortality. The highest-ranked model for predicting mortality utilized the Gaussian Copula data-synthesis technique and the CatBoost classifier, achieving an AUC of 0.9731 and an F1 score of 0.7059. These findings highlight the effectiveness of machine learning algorithms and data-synthesis techniques in improving the prediction performance of mortality in EDs.

Publisher

Springer Science and Business Media LLC

Subject

Multidisciplinary

Link

https://www.nature.com/articles/s41598-023-41544-0.pdf

Reference32 articles.

1. CDC. FastStats—Emergency department visits. https://www.cdc.gov/nchs/fastats/emergency-department.htm (2018).

2. Zachariasse, J. M. et al. Performance of triage systems in emergency care: A systematic review and meta-analysis. BMJ Open 9, e026471. https://doi.org/10.1136/bmjopen-2018-026471 (2019).

3. Qureshi, M. N. & AlRajhi, A. Challenge of Covid-19 crisis managed by emergency department of a big tertiary centre in Saudi Arabia. Int. J. Pediatr. Adolesc. Med. 7, 147–152. https://doi.org/10.1016/J.IJPAM.2020.08.001 (2020).

4. Morley, C., Unwin, M., Peterson, G. M., Stankovich, J. & Kinsman, L. Emergency department crowding: A systematic review of causes, consequences and solutions. PLoS ONE 13, e0203316. https://doi.org/10.1371/JOURNAL.PONE.0203316 (2018).

5. Truog, R. D., Mitchell, C. & Daley, G. Q. The toughest triage—Allocating ventilators in a pandemic. New Engl. J. Med. 382, 1973–1975. https://doi.org/10.1056/NEJMp2005689 (2020).

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Can supervised deep learning architecture outperform autoencoders in building propensity score models for matching?;BMC Medical Research Methodology;2024-08-02

2. Can I trust my fake data – A comprehensive quality assessment framework for synthetic tabular data in healthcare;International Journal of Medical Informatics;2024-05

3. Pseudo datasets explain artificial neural networks;International Journal of Data Science and Analytics;2024-04-10

4. A comparative study of explainable ensemble learning and logistic regression for predicting in-hospital mortality in the emergency department;Scientific Reports;2024-02-10