Trainable Undersampling for Class-Imbalance Learning-Reference-Cited by-同舟云学术

Trainable Undersampling for Class-Imbalance Learning

Published:2019-07-17 Issue: Volume:33 Page:4707-4714
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Peng Minlong,Zhang Qi,Xing Xiaoyu,Gui Tao,Huang Xuanjing,Jiang Yu-Gang,Ding Keyu,Chen Zhigang

Abstract

Undersampling has been widely used in the class-imbalance learning area. The main deficiency of most existing undersampling methods is that their data sampling strategies are heuristic-based and independent of the used classifier and evaluation metric. Thus, they may discard informative instances for the classifier during the data sampling. In this work, we propose a meta-learning method built on the undersampling to address this issue. The key idea of this method is to parametrize the data sampler and train it to optimize the classification performance over the evaluation metric. We solve the non-differentiable optimization problem for training the data sampler via reinforcement learning. By incorporating evaluation metric optimization into the data sampling process, the proposed method can learn which instance should be discarded for the given classifier and evaluation metric. In addition, as a data level operation, this method can be easily applied to arbitrary evaluation metric and classifier, including non-parametric ones (e.g., C4.5 and KNN). Experimental results on both synthetic and realistic datasets demonstrate the effectiveness of the proposed method.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 62 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A post-processing framework for class-imbalanced learning in a transductive setting;Expert Systems with Applications;2024-09

2. Class Imbalance Problem: A Wrapper-Based Approach using Under-Sampling with Ensemble Learning;Information Systems Frontiers;2024-08-29

3. Customised-sampling approach for pipe failure prediction in water distribution networks;Scientific Reports;2024-08-06

4. A Domain-Specific Tool for the Creation of Machine Learning Models with Imbalanced Datasets;2024 IEEE International Conference on Smart Computing (SMARTCOMP);2024-06-29

5. Automating the production of Machine Learning models for imbalanced datasets - application to Industry 4.0;2024 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events (PerCom Workshops);2024-03-11