What do Black-box Machine Learning Prediction Models See?- An Application Study With Sepsis Detection

Author:

Strickler Ethan A. T.1,Thomas Joshua2,Thomas Johnson P.3,Benjamin Bruce3,Shamsuddin Rittika3

Affiliation:

1. East Central University

2. Rush University Medical Center

3. Oklahoma State University

Abstract

Abstract Purpose The purpose of this study is to identify additional clinical features for sepsis detection through the use of a novel mechanism for interpreting black-box machine learning models trained and to provide a suitable evaluation for the mechanism. Methods We use the publicly available dataset from the 2019 PhysioNet Challenge. It has around 40,000 Intensive Care Unit (ICU) patients with 40 physiological variables. Using Long Short-Term Memory (LSTM) as the representative black-box machine learning model, we adapted the Multi-set Classifier to globally interpret the black-box model for concepts it learned about sepsis. To identify relevant features, the result is compared against: i) features used by a computational sepsis expert, ii) clinical features from clinical collaborators, iii) academic features from literature, and iv) significant features from statistical hypothesis testing. Results Random Forest (RF) was found to be the computational sepsis expert because it had high accuracies for solving both the detection and early detection, and a high degree of overlap with clinical and literature features. Using the proposed interpretation mechanism and the dataset, we identified 17 features that the LSTM used for sepsis classification, 11 of which overlaps with the top 20 features from the RF model, 10 with academic features and 5 with clinical features. Clinical opinion suggests, 3 LSTM features have strong correlation with some clinical features that were not identified by the mechanism. We also found that age, chloride ion concentration, pH and oxygen saturation should be investigated further for connection with developing sepsis. Conclusion Interpretation mechanisms can bolster the incorporation of state-of-the-art machine learning models into clinical decision support systems, and might help clinicians to address the issue of early sepsis detection. The promising results from this study warrants further investigation into creation of new and improvement of existing interpretation mechanisms for black-box models, and into clinical features that are currently not used in clinical assessment of sepsis.

Publisher

Research Square Platform LLC

Reference75 articles.

1. Callahan A. (2017). Key Advances in Clinical Informatics. Chapter 19 Machine Learning in Healthcare. ISBN: 9780128095232

2. Machine learning applications in cancer prognosis and prediction, Computational and Structural Biotechnology Journal;Konstantina Kourou Themis P,2015

3. An Interpretable Machine Learning Model for Accurate Prediction of Sepsis in the ICU;Nemati S;Critical care medicine,2018

4. "Breast Cancer Histopathology ImageAnalysis: A Review,";Veta M;IEEE Transactions on Biomedical Engineering,2014

5. Charron, Martin; Beyer, Thomas; Bohnen, Nicholas N.; Kinahan, Paul E.; Dachille, Marsha; Jerin, Jeff; Nutt, Ronald; Meltzer, Carolyn Cidis; Villemagne, Victor; Townsend, David W. Image Analysis in Patients with Cancer Studied with a Combined PET and CT Scanner, Clinical Nuclear Medicine: November 2000 – Volume 25 - Issue 11 - p 905–910

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3