Framework for Testing Robustness of Machine Learning-Based Classifiers-Reference-Cited by-同舟云学术

Framework for Testing Robustness of Machine Learning-Based Classifiers

Published:2022-08-14 Issue:8 Volume:12 Page:1314
ISSN:2075-4426
Container-title:Journal of Personalized Medicine
language:en
Short-container-title:JPM

Author:

Chuah Joshua,Kruger Uwe,Wang Ge^ORCID,Yan Pingkun,Hahn Juergen^ORCID

Abstract

There has been a rapid increase in the number of artificial intelligence (AI)/machine learning (ML)-based biomarker diagnostic classifiers in recent years. However, relatively little work has focused on assessing the robustness of these biomarkers, i.e., investigating the uncertainty of the AI/ML models that these biomarkers are based upon. This paper addresses this issue by proposing a framework to evaluate the already-developed classifiers with regard to their robustness by focusing on the variability of the classifiers’ performance and changes in the classifiers’ parameter values using factor analysis and Monte Carlo simulations. Specifically, this work evaluates (1) the importance of a classifier’s input features and (2) the variability of a classifier’s output and model parameter values in response to data perturbations. Additionally, it was found that one can estimate a priori how much replacement noise a classifier can tolerate while still meeting accuracy goals. To illustrate the evaluation framework, six different AI/ML-based biomarkers are developed using commonly used techniques (linear discriminant analysis, support vector machines, random forest, partial-least squares discriminant analysis, logistic regression, and multilayer perceptron) for a metabolomics dataset involving 24 measured metabolites taken from 159 study participants. The framework was able to correctly predict which of the classifiers should be less robust than others without recomputing the classifiers itself, and this prediction was then validated in a detailed analysis.

Funder

National Institutes of Health

National Institute on Aging

Publisher

MDPI AG

Subject

Medicine (miscellaneous)

Link

https://www.mdpi.com/2075-4426/12/8/1314/pdf

Reference47 articles.

1. Biomarkers, EndpointS, and Other Tools Resource

2. State of the Field in Multi-Omics Research: From Computational Needs to Data Mining and Sharing

3. Machine Learning Applications for Mass Spectrometry-Based Metabolomics

4. Predictive modeling for Metabolomics Data;Ghosh,2020

5. Not-CA-22-037: Notice of Special Interest (NOSI): Validation of Digital Health and Artificial Intelligence Tools for Improved Assessment in Epidemiological, Clinical, and Intervention Research

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Exploring the potential of routine serological markers in predicting neurological outcomes in spinal cord injury;Experimental Neurology;2024-10

2. Identification of high-risk population of pneumoconiosis using deep learning segmentation of lung 3D images and radiomics texture analysis;Computer Methods and Programs in Biomedicine;2024-02

3. Ensemble Reinforcement Learning in Collision Avoidance to Enhance Decision-Making Reliability;2023 7th International Conference on System Reliability and Safety (ICSRS);2023-11-22

4. Risk factors for high CAD-RADS scoring in CAD patients revealed by machine learning methods: a retrospective study;PeerJ;2023-08-03

5. Performance improvement and complexity reduction in the classification of EMG signals with mRMR-based CNN-KNN combined model;Journal of Intelligent & Fuzzy Systems;2023-01-30