A composite entropy-based uncertainty measure guided attribute reduction for imbalanced mixed-type data-Reference-Cited by-同舟云学术

A composite entropy-based uncertainty measure guided attribute reduction for imbalanced mixed-type data

Published:2024-03-05 Issue:3 Volume:46 Page:7307-7325
ISSN:1064-1246
Container-title:Journal of Intelligent & Fuzzy Systems
language:
Short-container-title:IFS

Author:

Shu Wenhao¹,Li Shipeng¹,Qian Wenbin²

Affiliation:

1. School of Information Engineering, East China Jiaotong University, Nanchang, Jiangxi, China

2. School of Software, Jiangxi Agriculture University, Nanchang, Jiangxi, China

Abstract

In real-world scenarios, datasets generally exhibit containing mixed-type of attributes and imbalanced classes distribution, and the minority classes in the data are the primary research focus. Attribute reduction is a key step in the data preprocessing process, but traditional attribute reduction methods commonly overlook the significance of minority class samples, causing the critical information possessed in minority class samples to damage and decrease the performance of classification. In order to address this issue, we develop an attribute reduction algorithm based on a composite entropy-based uncertainty measure to handle imbalanced mixed-type data. To begin with, we design a novel oversampling method based on the three-way decisions boundary region to synthesize the samples of minority class, for the boundary region to contain more high-quality samples. Then, we propose an attribute measure to select candidate attributes, which considers the boundary entropy, degree of dependency and weight of classes. On this basis, a composite entropy-based uncertainty measure guided attribute reduction algorithm is developed to select the attribute subset for the imbalanced mixed-type data. Experimental on UCI imbalanced datasets, as well as the results indicate that the developed attribute reduction algorithm is significantly outperforms compared to other attribute reduction algorithms, especially in total AUC, F1-Score and G-Mean.

Publisher

IOS Press

Reference51 articles.

1. Tri-level attribute reduction in rough set theory;Zhang;Expert Systems with Applications,2022

2. Feature selection based on multiview entropy measures inmultiperspective rough set;Xu;International Journal of Intelligent Systems,2022

3. Variable radius neighborhood rough sets and attribute reduction;Zhang;International Journalof Approximate Reasoning,2022

4. Neighborhood rough set based heterogeneous feature subset selection;Hu;Information Sciences,2008

5. A class-specific feature selection and classification approach usingneighborhood rough set and k-nearest neighbor theories;Sewwandi;Applied Soft Computing,2023

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A dynamic attribute reduction algorithm based on relative neighborhood discernibility degree;Scientific Reports;2024-07-08