Using Item Response Theory for Explainable Machine Learning in Predicting Mortality in the Intensive Care Unit: Case-Based Approach-Reference-Cited by-同舟云学术

Using Item Response Theory for Explainable Machine Learning in Predicting Mortality in the Intensive Care Unit: Case-Based Approach

Published:2020-09-25 Issue:9 Volume:22 Page:e20268
ISSN:1438-8871
Container-title:Journal of Medical Internet Research
language:en
Short-container-title:J Med Internet Res

Author:

Kline Adrienne^ORCID,Kline Theresa^ORCID,Shakeri Hossein Abad Zahra^ORCID,Lee Joon^ORCID

Abstract

Background Supervised machine learning (ML) is being featured in the health care literature with study results frequently reported using metrics such as accuracy, sensitivity, specificity, recall, or F1 score. Although each metric provides a different perspective on the performance, they remain to be overall measures for the whole sample, discounting the uniqueness of each case or patient. Intuitively, we know that all cases are not equal, but the present evaluative approaches do not take case difficulty into account. Objective A more case-based, comprehensive approach is warranted to assess supervised ML outcomes and forms the rationale for this study. This study aims to demonstrate how the item response theory (IRT) can be used to stratify the data based on how difficult each case is to classify, independent of the outcome measure of interest (eg, accuracy). This stratification allows the evaluation of ML classifiers to take the form of a distribution rather than a single scalar value. Methods Two large, public intensive care unit data sets, Medical Information Mart for Intensive Care III and electronic intensive care unit, were used to showcase this method in predicting mortality. For each data set, a balanced sample (n=8078 and n=21,940, respectively) and an imbalanced sample (n=12,117 and n=32,910, respectively) were drawn. A 2-parameter logistic model was used to provide scores for each case. Several ML algorithms were used in the demonstration to classify cases based on their health-related features: logistic regression, linear discriminant analysis, K-nearest neighbors, decision tree, naive Bayes, and a neural network. Generalized linear mixed model analyses were used to assess the effects of case difficulty strata, ML algorithm, and the interaction between them in predicting accuracy. Results The results showed significant effects (P<.001) for case difficulty strata, ML algorithm, and their interaction in predicting accuracy and illustrated that all classifiers performed better with easier-to-classify cases and that overall the neural network performed best. Significant interactions suggest that cases that fall in the most arduous strata should be handled by logistic regression, linear discriminant analysis, decision tree, or neural network but not by naive Bayes or K-nearest neighbors. Conventional metrics for ML classification have been reported for methodological comparison. Conclusions This demonstration shows that using the IRT is a viable method for understanding the data that are provided to ML algorithms, independent of outcome measures, and highlights how well classifiers differentiate cases of varying difficulty. This method explains which features are indicative of healthy states and why. It enables end users to tailor the classifier that is appropriate to the difficulty level of the patient for personalized medicine.

Publisher

JMIR Publications Inc.

Subject

Health Informatics

Reference55 articles.

1. Collaborative Filtering

2. Building an Evaluation Scale using Item Response Theory

3. Integrating machine learning into item response theory for addressing the cold start problem in adaptive learning systems

4. Item response theory in AI: Analysing machine learning classifiers at the instance level

5. Novel Feature Selection for Artificial Intelligence Using Item Response Theory for Mortality Prediction

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Development of short forms for screening children’s dental caries and urgent treatment needs using item response theory and machine learning methods;PLOS ONE;2024-03-22

2. How can organizations measure the integration of environmental, social, and governance (ESG) criteria? Validation of an instrument using item response theory to capture workers' perception;Business Strategy and the Environment;2024-01-12

3. Explainable artificial intelligence in information systems: A review of the status quo and future research directions;Electronic Markets;2023-05-27

4. Clinical applications of artificial intelligence and machine learning in the modern cardiac intensive care unit;Intelligence-Based Medicine;2023

5. A new modification and application of item response theory‐based feature selection for different machine learning tasks;Concurrency and Computation: Practice and Experience;2022-08-15