Comparative analysis of explainable machine learning prediction models for hospital mortality-Reference-Cited by-同舟云学术

Comparative analysis of explainable machine learning prediction models for hospital mortality

Published:2022-02-27 Issue:1 Volume:22 Page:
ISSN:1471-2288
Container-title:BMC Medical Research Methodology
language:en
Short-container-title:BMC Med Res Methodol

Author:

Stenwig Eline^ORCID,Salvi Giampiero,Rossi Pierluigi Salvo,Skjærvold Nils Kristian

Abstract

Abstract Background Machine learning (ML) holds the promise of becoming an essential tool for utilising the increasing amount of clinical data available for analysis and clinical decision support. However, the lack of trust in the models has limited the acceptance of this technology in healthcare. This mistrust is often credited to the shortage of model explainability and interpretability, where the relationship between the input and output of the models is unclear. Improving trust requires the development of more transparent ML methods. Methods In this paper, we use the publicly available eICU database to construct a number of ML models before examining their internal behaviour with SHapley Additive exPlanations (SHAP) values. Our four models predicted hospital mortality in ICU patients using a selection of the same features used to calculate the APACHE IV score and were based on random forest, logistic regression, naive Bayes, and adaptive boosting algorithms. Results The results showed the models had similar discriminative abilities and mostly agreed on feature importance while calibration and impact of individual features differed considerably and did in multiple cases not correspond to common medical theory. Conclusions We already know that ML models treat data differently depending on the underlying algorithm. Our comparative analysis visualises implications of these differences and their importance in a healthcare setting. SHAP value analysis is a promising method for incorporating explainability in model development and usage and might yield better and more trustworthy ML models in the future.

Funder

Helse Midt-Norge

Publisher

Springer Science and Business Media LLC

Subject

Health Informatics,Epidemiology

Link

https://link.springer.com/content/pdf/10.1186/s12874-022-01540-w.pdf

Reference29 articles.

1. Bohr A, Memarzadeh K. The Rise of Artificial Intelligence in Healthcare Applications, vol. January.2020, pp. 25–60. https://doi.org/10.1016/b978-0-12-818438-7.00002-2.

2. Goldstein BA, Navar AM, Carter RE. Moving beyond regression techniques in cardiovascular risk prediction: Applying machine learning to address analytic challenges. Eur Heart J. 2017; 38(23):1805–14. https://doi.org/10.1093/eurheartj/ehw302.

3. Asan O, Bayrak AE, Choudhury A. Artificial Intelligence and Human Trust in Healthcare: Focus on Clinicians,. J Med Internet Res. 2020; 22(6):15154. https://doi.org/10.2196/15154.