Impact of Imbalanced Datasets Preprocessing in the Performance of Associative Classifiers-Reference-Cited by-同舟云学术

Impact of Imbalanced Datasets Preprocessing in the Performance of Associative Classifiers

Published:2020-04-16 Issue:8 Volume:10 Page:2779
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Rangel-Díaz-de-la-Vega Adolfo,Villuendas-Rey Yenny^ORCID,Yáñez-Márquez Cornelio^ORCID,Camacho-Nieto Oscar,López-Yáñez Itzamá^ORCID

Abstract

In this paper, an experimental study was carried out to determine the influence of imbalanced datasets preprocessing in the performance of associative classifiers, in order to find the better computational solutions to the problem of credit scoring. To do this, six undersampling algorithms, six oversampling algorithms and four hybrid algorithms were evaluated in 13 imbalanced datasets referring to credit scoring. Then, the performance of four associative classifiers was analyzed. The experiments carried out allowed us to determine which sampling algorithms had the best results, as well as their impact on the associative classifiers evaluated. Accordingly, we determine that the Hybrid Associative Classifier with Translation, the Extended Gamma Associative Classifier and the Naïve Associative Classifier do not improve their performance by using sampling algorithms for credit data balancing. On the other hand, the Smallest Normalized Difference Associative Memory classifier was beneficiated by using oversampling and hybrid algorithms.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/10/8/2779/pdf

Reference77 articles.

1. On Class Imbalance Correction for Classification Algorithms in Credit Scoring;Bischl,2014

2. On the use of data filtering techniques for credit risk prediction with instance-based models

3. On the suitability of resampling techniques for the class imbalance problem in credit scoring

4. Sample selection bias in credit scoring models

5. Deep Neural Network Approach in Human-Like Redundancy Optimization for Anthropomorphic Manipulators

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Benchmarking state-of-the-art imbalanced data learning approaches for credit scoring;Expert Systems with Applications;2023-03

2. Special Issue on Data Preprocessing in Pattern Recognition: Recent Progress, Trends and Applications;Applied Sciences;2022-08-30

3. Hybrid data selection with preservation rough sets;Soft Computing;2022-08-25

4. Correlation Assessment of the Performance of Associative Classifiers on Credit Datasets Based on Data Complexity Measures;Mathematics;2022-04-26