Performance Evaluation in Machine Learning: The Good, the Bad, the Ugly, and the Way Forward-Reference-Cited by-同舟云学术

Performance Evaluation in Machine Learning: The Good, the Bad, the Ugly, and the Way Forward

Published:2019-07-17 Issue: Volume:33 Page:9808-9814
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Flach Peter

Abstract

This paper gives an overview of some ways in which our understanding of performance evaluation measures for machine-learned classifiers has improved over the last twenty years. I also highlight a range of areas where this understanding is still lacking, leading to ill-advised practices in classifier evaluation. This suggests that in order to make further progress we need to develop a proper measurement theory of machine learning. I then demonstrate by example what such a measurement theory might look like and what kinds of new results it would entail. Finally, I argue that key properties such as classification ability and data set difficulty are unlikely to be directly observable, suggesting the need for latent-variable models and causal inference.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 57 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Predicting Steady-State Metabolic Power in Cerebral Palsy, Stroke, and the Elderly During Walking With and Without Assistive Devices;Annals of Biomedical Engineering;2024-09-08

2. The Limitations of Data, Machine Learning and Us;Companion of the 2024 International Conference on Management of Data;2024-06-09

3. Enhanced machine learning models development for flash flood mapping using geospatial data;Euro-Mediterranean Journal for Environmental Integration;2024-05-31

4. Empirical analysis of performance assessment for imbalanced classification;Machine Learning;2024-01-23

5. TISBE: A Public Web Platform for the Consensus-Based Explainable Prediction of Developmental Toxicity;Chemical Research in Toxicology;2024-01-10