Trusting my predictions: on the value of Instance-Level analysis-Reference-Cited by-同舟云学术

Trusting my predictions: on the value of Instance-Level analysis

Published:2023-08-09 Issue: Volume: Page:
ISSN:0360-0300
Container-title:ACM Computing Surveys
language:en
Short-container-title:ACM Comput. Surv.

Author:

Lorena Ana C.¹,Paiva Pedro Y. A.¹,Prudêncio Ricardo B. C.²

Affiliation:

1. Instituto Tecnológico de Aeronáutica, Brazil

2. Centro de Informática, Universidade Federal de Pernambuco, Brazil

Abstract

Machine Learning solutions have spread along many domains, including critical applications. The development of such models usually relies on a dataset containing labeled data. This dataset is then split into training and test sets and the accuracy of the models in replicating the test labels is assessed. This process is often iterated in a cross-validation procedure for obtaining average performance estimates. But is the average of the predictive performance on test sets enough for assessing the trustfulness of a Machine Learning model? This paper discusses the importance of knowing which individual observations of a dataset are more challenging than others and how this characteristic can be measured and used in order to improve classification performance and trustfulness. A set of strategies for measuring the hardness level of the instances of a dataset is surveyed and a Python package containing their implementation is provided.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3615354

Reference48 articles.

1. Measuring Instance Hardness Using Data Complexity Measures

2. Assessing the data complexity of imbalanced datasets

3. Christopher M Bishop and Nasser M Nasrabadi . 2006. Pattern recognition and machine learning. Vol. 4 . Springer . Christopher M Bishop and Nasser M Nasrabadi. 2006. Pattern recognition and machine learning. Vol. 4. Springer.

4. On the appropriateness of Platt scaling in classifier calibration

5. Leo Breiman . 1996. Bagging predictors. Machine learning 24, 2 ( 1996 ), 123–140. Leo Breiman. 1996. Bagging predictors. Machine learning 24, 2 (1996), 123–140.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Assessor Models for Explaining Instance Hardness in Classification Problems;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

2. Measuring Latent Traits of Instance Hardness and Classifier Ability using Boltzmann Machines;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30