Deconstructing Cross-Entropy for Probabilistic Binary Classifiers-Reference-Cited by-同舟云学术

Deconstructing Cross-Entropy for Probabilistic Binary Classifiers

Published:2018-03-20 Issue:3 Volume:20 Page:208
ISSN:1099-4300
Container-title:Entropy
language:en
Short-container-title:Entropy

Author:

Ramos Daniel^ORCID,Franco-Pedroso Javier,Lozano-Diez Alicia^ORCID,Gonzalez-Rodriguez Joaquin^ORCID

Abstract

In this work, we analyze the cross-entropy function, widely used in classifiers both as a performance measure and as an optimization objective. We contextualize cross-entropy in the light of Bayesian decision theory, the formal probabilistic framework for making decisions, and we thoroughly analyze its motivation, meaning and interpretation from an information-theoretical point of view. In this sense, this article presents several contributions: First, we explicitly analyze the contribution to cross-entropy of (i) prior knowledge; and (ii) the value of the features in the form of a likelihood ratio. Second, we introduce a decomposition of cross-entropy into two components: discrimination and calibration. This decomposition enables the measurement of different performance aspects of a classifier in a more precise way; and justifies previously reported strategies to obtain reliable probabilities by means of the calibration of the output of a discriminating classifier. Third, we give different information-theoretical interpretations of cross-entropy, which can be useful in different application scenarios, and which are related to the concept of reference probabilities. Fourth, we present an analysis tool, the Empirical Cross-Entropy (ECE) plot, a compact representation of cross-entropy and its aforementioned decomposition. We show the power of ECE plots, as compared to other classical performance representations, in two diverse experimental examples: a speaker verification system, and a forensic case where some glass findings are present.

Funder

Spanish Ministry of Economy and Competitiveness

Publisher

MDPI AG

Subject

General Physics and Astronomy

Link

https://www.mdpi.com/1099-4300/20/3/208/pdf

Reference45 articles.

1. Machine Learning: A Probabilistic Perspective;Murphy,2012

2. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods;Platt,1999

3. Properties and benefits of calibrated classifiers;Cohen,2004

Cited by 72 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Evaluating outlier probabilities: assessing sharpness, refinement, and calibration using stratified and weighted measures;Data Mining and Knowledge Discovery;2024-07-19

2. Class imbalance on medical image classification: towards better evaluation practices for discrimination and calibration performance;European Radiology;2024-06-11

3. Harmonizing Sounds: A Comprehensive Approach to Automated Music Transcription and Vocal Isolation;2024 International Conference on Innovations and Challenges in Emerging Technologies (ICICET);2024-06-07

4. Artificial neural networks analysis predicts long-term fistula function in hemodialysis patients following percutaneous transluminal angioplasty;EngMedicine;2024-06

5. Machine learning screening tools for the prediction of extraction yields of pharmaceutical compounds from wastewaters;Journal of Water Process Engineering;2024-05