Abstract
Determining the optimal feature set is a challenging problem, especially in an unsupervised domain. To mitigate the same, this paper presents a new unsupervised feature selection method, termed as densest feature graph augmentation with disjoint feature clusters. The proposed method works in two phases. The first phase focuses on finding the maximally non-redundant feature subset and disjoint features are added to the feature set in the second phase. To experimentally validate, the efficiency of the proposed method has been compared against five existing unsupervised feature selection methods on five UCI datasets in terms of three performance criteria, namely clustering accuracy, normalized mutual information, and classification accuracy. The experimental analyses have shown that the proposed method outperforms the considered methods.
Subject
Computational Mathematics,Computational Theory and Mathematics,Numerical Analysis,Theoretical Computer Science
Reference38 articles.
1. Recent advances and emerging challenges of feature selection in the context of big data;Knowl.-Based Syst.,2015
2. Bellman, R. (1957). Dynamic Programming, Princeton University Press.
3. Keogh, E., and Mueen, A. (2017). Encyclopedia of Machine Learning and Data Mining, Springer.
4. Wrappers for feature subset selection;Kohavi;Artif. Intell.,1997
5. An Introduction of Variable and Feature Selection;Guyon;J. Mach. Learn. Res. Spec. Issue Var. Feature Sel.,2003
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献