Computation of the distribution of model accuracy statistics in machine learning: Comparison between analytically derived distributions and simulation‐based methods-Reference-Cited by-同舟云学术

Computation of the distribution of model accuracy statistics in machine learning: Comparison between analytically derived distributions and simulation‐based methods

Published:2023-04 Issue:4 Volume:6 Page:
ISSN:2398-8835
Container-title:Health Science Reports
language:en
Short-container-title:Health Science Reports

Author:

Huang Alexander A.¹,Huang Samuel Y.²^ORCID

Affiliation:

1. Northwestern University Feinberg School of Medicine Northwestern University Chicago Illinois USA

2. Virginia Commonwealth School of Medicine Virginia Commonwealth University Richmond Virginia USA

Abstract

AbstractBackground and AimsAll fields have seen an increase in machine‐learning techniques. To accurately evaluate the efficacy of novel modeling methods, it is necessary to conduct a critical evaluation of the utilized model metrics, such as sensitivity, specificity, and area under the receiver operator characteristic curve (AUROC). For commonly used model metrics, we proposed the use of analytically derived distributions (ADDs) and compared it with simulation‐based approaches.MethodsA retrospective cohort study was conducted using the England National Health Services Heart Disease Prediction Cohort. Four machine learning models (XGBoost, Random Forest, Artificial Neural Network, and Adaptive Boost) were used. The distribution of the model metrics and covariate gain statistics were empirically derived using boot‐strap simulation (N = 10,000). The ADDs were created from analytic formulas from the covariates to describe the distribution of the model metrics and compared with those of bootstrap simulation.ResultsXGBoost had the most optimal model having the highest AUROC and the highest aggregate score considering six other model metrics. Based on the Anderson–Darling test, the distribution of the model metrics created from bootstrap did not significantly deviate from a normal distribution. The variance created from the ADD led to smaller SDs than those derived from bootstrap simulation, whereas the rest of the distribution remained not statistically significantly different.ConclusionsADD allows for cross study comparison of model metrics, which is usually done with bootstrapping that rely on simulations, which cannot be replicated by the reader.

Publisher

Wiley

Subject

General Medicine

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1002/hsr2.1214

Reference39 articles.

1. A new concordant partial AUC and partial c statistic for imbalanced data in the evaluation of machine learning algorithms

2. sigFeature: Novel Significant Feature Selection Method for Classification of Gene Expression Data Using Support Vector Machine and t Statistic

3. Machine learning-based receiver operating characteristic (ROC) curves for crisp and fuzzy classification of DNA microarrays in cancer research

4. A Comparison Study of Machine Learning (Random Survival Forest) and Classic Statistic (Cox Proportional Hazards) for Predicting Progression in High-Grade Glioma after Proton and Carbon Ion Radiotherapy

5. Machine learning and statistic analysis to predict drug treatment outcome in pediatric epilepsy patients with tuberous sclerosis complex

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Perspective Chapter: Enhancing Regression Analysis with Splines and Machine Learning – Evaluation of How to Capture Complex Non-Linear Multidimensional Variables;Nonlinear Systems and Matrix Analysis - Recent Advances in theory and Applications [Working Title];2024-09-11

2. Use machine learning models to identify and assess risk factors for coronary artery disease;PLOS ONE;2024-09-06

3. Artificial Intelligence in Malnutrition: A Systematic Literature Review;Advances in Nutrition;2024-09

4. Machine learning based identification potential feature genes for prediction of drug efficacy in nonalcoholic steatohepatitis animal model;Lipids in Health and Disease;2024-08-24

5. Severity and mortality of acute respiratory failure in pediatrics: A prospective multicenter cohort in Bogotá, Colombia;Health Science Reports;2024-06