Development and validation of an interpretable machine learning for mortality prediction in patients with sepsis-Reference-Cited by-同舟云学术

Development and validation of an interpretable machine learning for mortality prediction in patients with sepsis

Published:2024-07-08 Issue: Volume:7 Page:
ISSN:2624-8212
Container-title:Frontiers in Artificial Intelligence
language:
Short-container-title:Front. Artif. Intell.

Author:

He Bihua,Qiu Zheng

Abstract

IntroductionSepsis is a leading cause of death. However, there is a lack of useful model to predict outcome in sepsis. Herein, the aim of this study was to develop an explainable machine learning (ML) model for predicting 28-day mortality in patients with sepsis based on Sepsis 3.0 criteria.MethodsWe obtained the data from the Medical Information Mart for Intensive Care (MIMIC)-III database (version 1.4). The overall data was randomly assigned to the training and testing sets at a ratio of 3:1. Following the application of LASSO regression analysis to identify the modeling variables, we proceeded to develop models using Extreme Gradient Boost (XGBoost), Logistic Regression (LR), Support Vector Machine (SVM), and Random Forest (RF) techniques with 5-fold cross-validation. The optimal model was selected based on its area under the curve (AUC). Finally, the Shapley additive explanations (SHAP) method was used to interpret the optimal model.ResultsA total of 5,834 septic adults were enrolled, the median age was 66 years (IQR, 54–78 years) and 2,342 (40.1%) were women. After feature selection, 14 variables were included for developing model in the training set. The XGBoost model (AUC: 0.806) showed superior performance with AUC, compared with RF (AUC: 0.794), LR (AUC: 0.782) and SVM model (AUC: 0.687). SHAP summary analysis for XGBoost model showed that urine output on day 1, age, blood urea nitrogen and body mass index were the top four contributors. SHAP dependence analysis demonstrated insightful nonlinear interactive associations between factors and outcome. SHAP force analysis provided three samples for model prediction.ConclusionIn conclusion, our study successfully demonstrated the efficacy of ML models in predicting 28-day mortality in sepsis patients, while highlighting the potential of the SHAP method to enhance model transparency and aid in clinical decision-making.

Publisher

Frontiers Media SA

Reference42 articles.

1. Opening the black box: interpretable machine learning for geneticists;Azodi;Trends Genet.,2020

2. Lasso adjustments of treatment effect estimates in randomized experiments;Bloniarz;Proc. Natl. Acad. Sci. USA,2016

3. Elevated body mass index is associated with an increased risk of infectious disease admissions and mortality: a mendelian randomization study;Butler-Laporte;Clin. Microbiol. Infect.,2020

4. Efficient statistical tests to compare Youden index: accounting for contingency correlation;Chen;Stat. Med.,2015

5. Using explainable machine learning to identify patients at risk of reattendance at discharge from emergency departments;Chmiel;Sci. Rep.,2021