Author:
Karamo Bah,Amadou Wurry Jallow,Adama Ns Bah,Musa Touray
Abstract
Background and aim: Congestive heart failure is a prevalent and serious condition that poses significant challenges in the emergency department setting. Prompt and accurate management of congestive heart failure patients is crucial for improving outcomes and optimizing resource utilization. This study aims to address these challenges by developing a machine learning algorithm and comparing it to a traditional logistic regression model that can assist in the triage, resource allocation, and long-term prognostication of congestive heart failure patients. Methods: In this investigation, we used the MIMIC-III database, a publicly accessible resource containing patient data from ICU settings. Traditional logistic regression, along with the robust XGBoost and random forest algorithms, was harnessed to construct predictive models. These models were built using a range of pretreatment clinical variables. To pinpoint the most pertinent features, we carried out a univariate analysis. Ensuring robust performance and broad applicability, we adopted a nested cross-validation approach. This method enhances the precision and validation of our models by implementing multiple cross-validation iterations. Results: The performance of machine learning algorithms was assessed using the area under the receiver operating characteristic curve (AUC). Notably, the random forest algorithm, despite having lower performance among the machine learning models still demonstrated significantly higher AUC than traditional logistic regression. The AUC for the XGBoost was 0.99, random forest 0.98, while traditional logistic regression was 0.57. The most important pretreatment variables associated with congestive heart failure include total bilirubin, creatine kinase, international normalized ratio (INR), sodium, age, creatinine, potassium, gender, alkaline phosphatase, and platelets. Conclusion: Machine learning techniques utilizing multiple pretreatment clinical variables outperform traditional logistic regression in aiding the triage, resource allocation, and long-term prognostication of congestive heart failure patients in the intensive care unit setting using MIMIC III data.
Publisher
Heighten Science Publications Corporation
Subject
Computer Science Applications,History,Education
Reference45 articles.
1. 1. Benjamin EJ, Blaha MJ, Chiuve SE, Cushman M, Das SR, Deo R, de Ferranti SD, Floyd J, Fornage M, Gillespie C, Isasi CR, Jiménez MC, Jordan LC, Judd SE, Lackland D, Lichtman JH, Lisabeth L, Liu S, Longenecker CT, et al. American Heart Association Statistics Committee and Stroke Statistics Subcommittee. Heart Disease and Stroke Statistics-2017 Update: A Report From the American Heart Association. Circulation. 2017 Mar 7;135(10):e146-e603. doi: 10.1161/CIR.0000000000000485. Epub 2017 Jan 25. Erratum in: Circulation. 2017 Mar 7;135(10 ):e646. Erratum in: Circulation. 2017 Sep 5;136(10 ):e196. PMID: 28122885; PMCID: PMC5408160.
2. 2. Braunwald E, Bristow MR. Congestive heart failure: fifty years of progress. Circulation. 2000 Nov 14;102(20 Suppl 4):IV14-23. doi: 10.1161/01.cir.102.suppl_4.iv-14. PMID: 11080127.
3. 3. Averbuch T, Lee SF, Mamas MA, Oz UE, Perez R, Connolly SJ, Ko DT, Van Spall HGC. Derivation and validation of a two-variable index to predict 30-day outcomes following heart failure hospitalization. ESC Heart Fail. 2021 Aug;8(4):2690-2697. doi: 10.1002/ehf2.13324. Epub 2021 May 1. PMID: 33932113; PMCID: PMC8318488.
4. 4. Averbuch T, Sullivan K, Sauer A, Mamas MA, Voors AA, Gale CP, Metra M, Ravindra N, Van Spall HGC. Applications of artificial intelligence and machine learning in heart failure. Eur Heart J Digit Health. 2022 May 13;3(2):311-322. doi: 10.1093/ehjdh/ztac025. PMID: 36713018; PMCID: PMC9707916.
5. 5. Cook JA, Collins GS. The rise of big clinical databases. Br J Surg. 2015 Jan;102(2):e93-e101. doi: 10.1002/bjs.9723. PMID: 25627139.