A Probability-Based Models Ranking Approach: An Alternative Method of Machine-Learning Model Performance Assessment-Reference-Cited by-同舟云学术

A Probability-Based Models Ranking Approach: An Alternative Method of Machine-Learning Model Performance Assessment

Published:2022-08-24 Issue:17 Volume:22 Page:6361
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Gajda Stanisław,Chlebus Marcin^ORCID

Abstract

Performance measures are crucial in selecting the best machine learning model for a given problem. Estimating classical model performance measures by subsampling methods like bagging or cross-validation has several weaknesses. The most important ones are the inability to test the significance of the difference, and the lack of interpretability. Recently proposed Elo-based Predictive Power (EPP)—a meta-measure of machine learning model performance, is an attempt to address these weaknesses. However, the EPP is based on wrong assumptions, so its estimates may not be correct. This paper introduces the Probability-based Ranking Model Approach (PMRA), which is a modified EPP approach with a correction that makes its estimates more reliable. PMRA is based on the calculation of the probability that one model achieves a better result than another one, using the Mixed Effects Logistic Regression model. The empirical analysis was carried out on a real mortgage credits dataset. The analysis included a comparison of how the PMRA and state-of-the-art k-fold cross-validation ranked the 49 machine learning models, an example application of a novel method in hyperparameters tuning problem, and a comparison of PMRA and EPP indications. PMRA gives the opportunity to compare a newly developed algorithm to state-of-the-art algorithms based on statistical criteria. It is the solution to select the best hyperparameters configuration and to formulate criteria for the continuation of the hyperparameters space search.

Funder

Ministry of Education

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/22/17/6361/pdf

Reference44 articles.

1. A study of statistical techniques and performance measures for genetics-based machine learning: accuracy and interpretability

2. Deep Learning in Double Dummy Bridge Problem;Kowalik;Ph.D. Thesis,2021

3. A comparative study of breast cancer tumor classification by classical machine learning methods and deep learning method

4. Application of the cross entropy method to the GLVQ algorithm

5. A Deep Learning Mammography-based Model for Improved Breast Cancer Risk Prediction

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep-PK: deep learning for small molecule pharmacokinetic and toxicity prediction;Nucleic Acids Research;2024-04-18

2. Snow avalanche susceptibility mapping from tree-based machine learning approaches in ungauged or poorly-gauged regions;CATENA;2023-05

3. Applications of machine learning in metabolomics: Disease modeling and classification;Frontiers in Genetics;2022-11-24