Identifying Predictors Associated with Risk of Death or Admission to Intensive Care Unit in Internal Medicine Patients with Sepsis: A Comparison of Statistical Models and Machine Learning Algorithms

Author:

Mirijello Antonio1ORCID,Fontana Andrea2ORCID,Greco Antonio Pio1ORCID,Tosoni Alberto3ORCID,D’Agruma Angelo1ORCID,Labonia Maria4,Copetti Massimiliano2,Piscitelli Pamela1,De Cosmo Salvatore1,

Affiliation:

1. Department of Medical Sciences, Fondazione IRCCS Casa Sollievo della Sofferenza, 71013 San Giovanni Rotondo, Italy

2. Unit of Biostatistics, Fondazione IRCCS Casa Sollievo della Sofferenza, 71013 San Giovanni Rotondo, Italy

3. Department of Internal Medicine and Gastroenterology, Fondazione Policlinico Universitario “A. Gemelli” IRCCS, 00168 Rome, Italy

4. Unit of Microbiology, Fondazione IRCCS Casa Sollievo della Sofferenza, 71013 San Giovanni Rotondo, Italy

Abstract

Background: Sepsis is a time-dependent disease: the early recognition of patients at risk for poor outcome is mandatory. Aim: To identify prognostic predictors of the risk of death or admission to intensive care units in a consecutive sample of septic patients, comparing different statistical models and machine learning algorithms. Methods: Retrospective study including 148 patients discharged from an Italian internal medicine unit with a diagnosis of sepsis/septic shock and microbiological identification. Results: Of the total, 37 (25.0%) patients reached the composite outcome. The sequential organ failure assessment (SOFA) score at admission (odds ratio (OR): 1.83; 95% confidence interval (CI): 1.41–2.39; p < 0.001), delta SOFA (OR: 1.64; 95% CI: 1.28–2.10; p < 0.001), and the alert, verbal, pain, unresponsive (AVPU) status (OR: 5.96; 95% CI: 2.13–16.67; p < 0.001) were identified through the multivariable logistic model as independent predictors of the composite outcome. The area under the receiver operating characteristic curve (AUC) was 0.894; 95% CI: 0.840–0.948. In addition, different statistical models and machine learning algorithms identified further predictive variables: delta quick-SOFA, delta-procalcitonin, mortality in emergency department sepsis, mean arterial pressure, and the Glasgow Coma Scale. The cross-validated multivariable logistic model with the least absolute shrinkage and selection operator (LASSO) penalty identified 5 predictors; and recursive partitioning and regression tree (RPART) identified 4 predictors with higher AUC (0.915 and 0.917, respectively); the random forest (RF) approach, including all evaluated variables, obtained the highest AUC (0.978). All models’ results were well calibrated. Conclusions: Although structurally different, each model identified similar predictive covariates. The classical multivariable logistic regression model was the most parsimonious and calibrated one, while RPART was the easiest to interpret clinically. Finally, LASSO and RF were the costliest in terms of number of variables identified.

Publisher

MDPI AG

Subject

Pharmacology (medical),Infectious Diseases,Microbiology (medical),General Pharmacology, Toxicology and Pharmaceutics,Biochemistry,Microbiology

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3