Affiliation:
1. Northwestern University
2. Cornell University
Abstract
Abstract
Importance:
Machine learning methods are widely used within the medical field. However, the reliability and efficacy of these models is difficult to assess. We assessed whether variance calculations of model metrics (e.g., AUROC, Sensitivity, Specificity) through bootstrap simulation and SHapely Additive exPlanations (SHAP) could increase model transparency.
Methods
Data from the England National Health Services Heart Disease Prediction Cohort was used. XGBoost was used as the machine-learning model of choice in this study. Boost-strap simulation (N = 10,000) was used to empirically derive the distribution of model metrics and covariate Gain statistics. SHapely Additive exPlanations (SHAP) to provide explanations to machine-learning output and simulation to evaluate the variance of model accuracy metrics.
Result
Among 10,000 simulations completed, we observed that the AUROC ranged from 0.771 to 0.947, a difference of 0.176, the balanced accuracy ranged from 0.688 to 0.894, a 0.205 difference, the sensitivity ranged from 0.632 to 0.939, a 0.307 difference, and the specificity ranged from 0.595 to 0.944, a 0.394 difference. Among 10,000 simulations completed, we observed that the gain for Angina ranged from 0.225 to 0.456, a difference of 0.231, for Cholesterol ranged from 0.148 to 0.326, a difference of 0.178, the MaxHR ranged from 0.081 to 0.200, a range of 0.119, and for Age ranged from 0.059 to 0.157, difference of 0.098.
Conclusion
Use of simulations to empirically evaluate the variance of model metrics and explanatory algorithms to observe if covariates match the literature are necessary for increased transparency, reliability, and utility of machine learning methods.
Publisher
Research Square Platform LLC
Reference32 articles.
1. Heo J, Yoon JG, Park H, Kim YD, Nam HS, Heo JH. Machine Learning-Based Model for Prediction of Outcomes in Acute Stroke. Stroke. 2019 May;50(5):1263–1265. doi: 10.1161/STROKEAHA.118.024293. PMID: 30890116.
2. Machine Learning and Deep Learning Approaches in Breast Cancer Survival Prediction Using Clinical Data;Kalafi EY;Folia Biol (Praha),2019
3. Machine learning model for early prediction of acute kidney injury (AKI) in pediatric critical care;Dong J;Crit Care. 2021 Aug
4. Wang Z, Li H, Carpenter C, Guan Y. Challenge-Enabled Machine Learning to Drug-Response Prediction. AAPS J. 2020 Aug 10;22(5):106. doi: 10.1208/s12248-020-00494-5. PMID: 32778984.
5. Machine learning in the prediction of depression treatment outcomes: a systematic review and meta-analysis;Sajjadian M;Psychol Med,2021