ROC Curves, Loss Functions, and Distorted Probabilities in Binary Classification-Reference-Cited by-同舟云学术

ROC Curves, Loss Functions, and Distorted Probabilities in Binary Classification

Published:2022-04-22 Issue:9 Volume:10 Page:1410
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Le Phuong Bich^ORCID,Nguyen Zung Tien

Abstract

The main purpose of this work is to study how loss functions in machine learning influence the “binary machines”, i.e., probabilistic AI models for predicting binary classification problems. In particular, we show the following results: (i) Different measures of accuracy such as area under the curve (AUC) of the ROC curve, the maximal balanced accuracy, and the maximally weighted accuracy are topologically equivalent, with natural inequalities relating them; (ii) the so-called real probability machines with respect to given information spaces are the optimal machines, i.e., they have the highest precision among all possible machines, and moreover, their ROC curves are automatically convex; (iii) the cross-entropy and the square loss are the most natural loss functions in the sense that the real probability machine is their minimizer; (iv) an arbitrary strictly convex loss function will also have as its minimizer an optimal machine, which is related to the real probability machine by just a reparametrization of sigmoid values; however, if the loss function is not convex, then its minimizer is not an optimal machine, and strange phenomena may happen.

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/10/9/1410/pdf

Reference32 articles.

1. On the mathematical foundations of learning

2. Deep Learning;Goodfellow,2016

3. The Elements of Statistical Learning;Hastie,2001

4. Statistical Learning Theory;Vapnik,1998

5. Ensemble Methods: Foundations and Algorithms;Zhou,2012

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Research on a Method for Classifying Bolt Corrosion Based on an Acoustic Emission Sensor System;Sensors;2024-08-04

2. Impact of Hyperparameter Optimization to Enhance Machine Learning Performance: A Case Study on Breast Cancer Recurrence Prediction;Applied Sciences;2024-07-06

3. An Essay on Detailed Performance Assessment Ensemble-Based Predictive Modelling in Network Intrusion Detection Systems;2024 IEEE Students Conference on Engineering and Systems (SCES);2024-06-21

4. Comparative Analysis of Interatrial Septal Aneurysm Detection: ECG Image-Based CNN vs. ECG Data- Driven ANN Approach;2023-10-05

5. Prediction of model generalizability for unseen data: Methodology and case study in brain metastases detection in T1-Weighted contrast-enhanced 3D MRI;Computers in Biology and Medicine;2023-06