Feature Ranking and Screening for Class-Imbalanced Metabolomics Data Based on Rank Aggregation Coupled with Re-Balance-Reference-Cited by-同舟云学术

Feature Ranking and Screening for Class-Imbalanced Metabolomics Data Based on Rank Aggregation Coupled with Re-Balance

Published:2021-06-14 Issue:6 Volume:11 Page:389
ISSN:2218-1989
Container-title:Metabolites
language:en
Short-container-title:Metabolites

Author:

Fu Guang-Hui^ORCID,Wang Jia-Bao,Zong Min-Jie,Yi Lun-Zhao

Abstract

Feature screening is an important and challenging topic in current class-imbalance learning. Most of the existing feature screening algorithms in class-imbalance learning are based on filtering techniques. However, the variable rankings obtained by various filtering techniques are generally different, and this inconsistency among different variable ranking methods is usually ignored in practice. To address this problem, we propose a simple strategy called rank aggregation with re-balance (RAR) for finding key variables from class-imbalanced data. RAR fuses each rank to generate a synthetic rank that takes every ranking into account. The class-imbalanced data are modified via different re-sampling procedures, and RAR is performed in this balanced situation. Five class-imbalanced real datasets and their re-balanced ones are employed to test the RAR’s performance, and RAR is compared with several popular feature screening methods. The result shows that RAR is highly competitive and almost better than single filtering screening in terms of several assessing metrics. Performing re-balanced pretreatment is hugely effective in rank aggregation when the data are class-imbalanced.

Funder

the National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Molecular Biology,Biochemistry,Endocrinology, Diabetes and Metabolism

Link

https://www.mdpi.com/2218-1989/11/6/389/pdf

Reference57 articles.

1. Identifying Mislabeled Training Data

2. Data mining for imbalanced datasets: An overview;Chawla,2009

3. Learning from imbalanced data: open challenges and future directions

4. Imbalance: Oversampling algorithms for imbalanced classification in R

5. SMOTE: Synthetic Minority Over-sampling Technique

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Novel Deep Learning Framework for Intrusion Detection Systems in Wireless Network;Future Internet;2024-07-25

2. An adaptive loss backward feature elimination method for class-imbalanced and mixed-type data in medical diagnosis;Chemometrics and Intelligent Laboratory Systems;2023-05

3. An Improved Method of Polyp Detection Using Custom YOLOv4-Tiny;Applied Sciences;2022-10-26