Abstract
With the rapid development of science and technology, high-dimensional data have been widely used in various fields. Due to the complex characteristics of high-dimensional data, it is usually distributed in the union of several low-dimensional subspaces. In the past several decades, subspace clustering (SC) methods have been widely studied as they can restore the underlying subspace of high-dimensional data and perform fast clustering with the help of the data self-expressiveness property. The SC methods aim to construct an affinity matrix by the self-representation coefficient of high-dimensional data and then obtain the clustering results using the spectral clustering method. The key is how to design a self-expressiveness model that can reveal the real subspace structure of data. In this survey, we focus on the development of SC methods in the past two decades and present a new classification criterion to divide them into three categories based on the purpose of clustering, i.e., low-rank sparse SC, local structure preserving SC, and kernel SC. We further divide them into subcategories according to the strategy of constructing the representation coefficient. In addition, the applications of SC methods in face recognition, motion segmentation, handwritten digits recognition, and speech emotion recognition are introduced. Finally, we have discussed several interesting and meaningful future research directions.
Funder
National Natural Science Foundation of China
Project Funded by China Postdoctoral Science Foundation
Subject
General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)
Reference184 articles.
1. Driver, H.E., and Kroeber, A.L. (1932). Quantitative Expression of Cultural Relationships, University of California Press.
2. A technique for measuring like-mindedness;Zubin;J. Abnorm. Soc. Psychol.,1938
3. The description of personality: Basic traits resolved into clusters;Cattell;J. Abnorm. Soc. Psychol.,1943
4. MacQueen, J. (1965–7, January 27). Some methods for classification and analysis of multivariate observations. Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, CA, USA.
5. Least squares quantization in PCM;Lloyd;IEEE Trans. Inf. Theory,1982
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Attributed graph subspace clustering with residual compensation guided by adaptive dual manifold regularization;Expert Systems with Applications;2024-12
2. Clustering Techniques in Data Mining: A Survey of Methods, Challenges, and Applications;Computer Science;2024-03-05
3. Constrained Hard-Soft Subspace Clustering;2023 3rd International Conference on Electronic Information Engineering and Computer (EIECT);2023-11-17
4. Semi-supervised Projected Subspace Clustering;2023 5th International Conference on Frontiers Technology of Information and Computer (ICFTIC);2023-11-17
5. Marginalized Graph Autoencoder with Subspace Structure Preserving;2023 4th International Conference on Big Data & Artificial Intelligence & Software Engineering (ICBASE);2023-08-25