CDSMOTE: class decomposition and synthetic minority class oversampling technique for imbalanced-data classification-Reference-Cited by-同舟云学术

CDSMOTE: class decomposition and synthetic minority class oversampling technique for imbalanced-data classification

Published:2020-07-18 Issue:7 Volume:33 Page:2839-2851
ISSN:0941-0643
Container-title:Neural Computing and Applications
language:en
Short-container-title:Neural Comput & Applic

Author:

Elyan Eyad^ORCID,Moreno-Garcia Carlos Francisco,Jayne Chrisina

Abstract

AbstractClass-imbalanced datasets are common across several domains such as health, banking, security, and others. The dominance of majority class instances (negative class) often results in biased learning models, and therefore, classifying such datasets requires employing some methods to compact the problem. In this paper, we propose a new hybrid approach aiming at reducing the dominance of the majority class instances using class decomposition and increasing the minority class instances using an oversampling method. Unlike other undersampling methods, which suffer data loss, our method preserves the majority class instances, yet significantly reduces its dominance, resulting in a more balanced dataset and hence improving the results. A large-scale experiment using 60 public datasets was carried out to validate the proposed methods. The results across three standard evaluation metrics show the comparable and superior results with other common and state-of-the-art techniques.

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Software

Link

https://link.springer.com/content/pdf/10.1007/s00521-020-05130-z.pdf

Reference49 articles.

1. Barandela R, Sánchez JS, García V, Rangel E (2003) Strategies for learning in class imbalance problems. Pattern Recognit 36:849–851

2. Japkowicz N, Stephen S (2002) The class imbalance problem: a systematic study. Intell Data Anal 6(5):429–449

3. Kotsiantis S, Kanellopoulos D, Pintelas P (2006) Handling imbalanced datasets: a review. GESTS Int Trans Comput Sci Eng 30(1):25–36

4. Chawla NV (2005) Data mining for imbalanced datasets: an overview. In: Maimon O, Rokach L (eds) Data mining and knowledge discovery handbook. Springer, Boston, MA

5. Haixiang G, Yijing L, Shang J, Mingyun G, Yuanyue H (2017) Learning from class-imbalanced data: review of methods and applications. Expert Syst Appl 73:220–239

Cited by 67 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. SFPL: Sample-specific fine-grained prototype learning for imbalanced medical image classification;Medical Image Analysis;2024-10

2. Generating synthetic data with variational autoencoder to address class imbalance of graph attention network prediction model for construction management;Advanced Engineering Informatics;2024-10

3. CARBO: Clustering and rotation based oversampling for class imbalance learning;Knowledge-Based Systems;2024-09

4. A failure risk assessment method for lithium-ion batteries based on big data of after-sales vehicles;Engineering Failure Analysis;2024-09

5. Performance analysis of lung cancer detection and classification using efficientNet: a deep learning model;Multimedia Tools and Applications;2024-08-22