Redundancy Is Not Necessarily Detrimental in Classification Problems-Reference-Cited by-同舟云学术

Redundancy Is Not Necessarily Detrimental in Classification Problems

Published:2021-11-15 Issue:22 Volume:9 Page:2899
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Grillo Sebastián Alberto^ORCID,Noguera José Luis Vázquez^ORCID,Mello Román Julio César Mello^ORCID,García-Torres Miguel^ORCID,Facon Jacques^ORCID,Pinto-Roa Diego P.^ORCID,Salgueiro Romero Luis Salgueiro^ORCID,Gómez-Vela Francisco^ORCID,Paniagua Laura Raquel Bareiro^ORCID,Correa Deysi Natalia Leguizamon^ORCID

Abstract

In feature selection, redundancy is one of the major concerns since the removal of redundancy in data is connected with dimensionality reduction. Despite the evidence of such a connection, few works present theoretical studies regarding redundancy. In this work, we analyze the effect of redundant features on the performance of classification models. We can summarize the contribution of this work as follows: (i) develop a theoretical framework to analyze feature construction and selection, (ii) show that certain properly defined features are redundant but make the data linearly separable, and (iii) propose a formal criterion to validate feature construction methods. The results of experiments suggest that a large number of redundant features can reduce the classification error. The results imply that it is not enough to analyze features solely using criteria that measure the amount of information provided by such features.

Funder

Consejo Nacional de Ciencia y Tecnología

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/9/22/2899/pdf

Reference64 articles.

1. A review of feature selection methods based on mutual information

2. An introduction to variable and feature selection;Guyon;J. Mach. Learn. Res.,2003

3. Feature construction methods: A survey;Sondhi;Sifaka. Cs. Uiuc. Edu.,2009

4. IG-GA: A Hybrid Filter/Wrapper Method for Feature Selection of Microarray Data;Yang;J. Med. Biol. Eng.,2010