A Survey of Feature Selection Techniques-Reference-Cited by-同舟云学术

A Survey of Feature Selection Techniques

Published:2009 Issue: Volume: Page:1888-1895
ISSN:
Container-title:Encyclopedia of Data Warehousing and Mining, Second Edition
language:
Short-container-title:

Author:

Chizi Barak¹,Rokach Lior²,Maimon Oded¹

Affiliation:

1. Tel-Aviv University, Israel

2. Ben-Gurion University, Israel

Abstract

Dimensionality (i.e., the number of data set attributes or groups of attributes) constitutes a serious obstacle to the efficiency of most data mining algorithms (Maimon and Last, 2000). The main reason for this is that data mining algorithms are computationally intensive. This obstacle is sometimes known as the “curse of dimensionality” (Bellman, 1961). The objective of Feature Selection is to identify features in the data-set as important, and discard any other feature as irrelevant and redundant information. Since Feature Selection reduces the dimensionality of the data, data mining algorithms can be operated faster and more effectively by using Feature Selection. In some cases, as a result of feature selection, the performance of the data mining method can be improved. The reason for that is mainly a more compact, easily interpreted representation of the target concept. The filter approach (Kohavi , 1995; Kohavi and John ,1996) operates independently of the data mining method employed subsequently -- undesirable features are filtered out of the data before learning begins. These algorithms use heuristics based on general characteristics of the data to evaluate the merit of feature subsets. A sub-category of filter methods that will be refer to as rankers, are methods that employ some criterion to score each feature and provide a ranking. From this ordering, several feature subsets can be chosen by manually setting There are three main approaches for feature selection: wrapper, filter and embedded. The wrapper approach (Kohavi, 1995; Kohavi and John,1996), uses an inducer as a black box along with a statistical re-sampling technique such as cross-validation to select the best feature subset according to some predictive measure. The embedded approach (see for instance Guyon and Elisseeff, 2003) is similar to the wrapper approach in the sense that the features are specifically selected for a certain inducer, but it selects the features in the process of learning.

Publisher

IGI Global

Reference35 articles.

1. Almuallim, H., & Dietterich, T. G. (1991). Learning with many irrelevant features. In Proceedings of the Ninth National Conference on Artificial Intelligence, 547– 542. MIT Press.

2. Bellman, R. (1961) Adaptive Control Processes: A Guided Tour, Princeton University Press.

3. Cherkauer, K. J., & Shavlik, J. W. (1996) Growing simpler decision trees to facilitate knowledge discovery. In Proceedings of the Second International Conference on Knowledge Discovery and Data Mining. AAAI Press.

4. Chizi, B., & Maimon, O. (2002). On Dimensionality Reduction of High Dimensional Data Sets, In Frontiers in Artificial Intelligence and Applications. pp. 230-236. IOS press

Cited by 17 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Color-CADx: a deep learning approach for colorectal cancer classification through triple convolutional neural networks and discrete cosine transform;Scientific Reports;2024-03-22

2. An efficient high-dimensional gene selection approach based on the Binary Horse Herd Optimization Algorithm for biologicaldata classification;Iran Journal of Computer Science;2024-02-08

3. Personalized Monitoring in Home Healthcare: An Assistive System for Post Hip Replacement Rehabilitation;2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW);2023-10-02

4. A novel binary horse herd optimization algorithm for feature selection problem;Multimedia Tools and Applications;2023-03-23

5. Hybrid of WOA-ABC and proposed CNN for intrusion detection system in wireless sensor networks;Optik;2022-12