Affiliation:
1. 1 Department of Fundamental Courses , Chengdu Textile College , Chengdu , Sichuan , , China .
Abstract
Abstract
In this paper, firstly, the fuzzy mathematical model and its types and algorithms of fuzzy numbers are studied. Then, the concept of fuzzy mathematics is added to cluster analysis, and fuzzy cluster analysis is carried out on samples or objects after data standardization and the construction of a similarity matrix. Then, the fuzzy C-mean FCM algorithm is proposed, and the FCM algorithm is improved by introducing the point density of data objects, the maximum minimum distance method of point density sampling, and the reduction of computation to improve the statistical function of clustering high-dimensional data. Finally, the algorithm of this paper is compared with other algorithms by AC, PR, RE, convergence speed, running time, DBI index and other indexes in order to analyze the advantages of the FCM algorithm in high dimensional data clustering statistics. The results show that the F value of the FCM algorithm in the Zoo dataset reaches 0.976, which improves 61% compared to FKM and nearly 12.6% compared to the IIFKM0 algorithm, with the best clustering effect. The DBI index of the FCM algorithm is the lowest in the four datasets, which is basically stabilized in the range of 0.5-0.6. This is better than other algorithms.
Subject
Applied Mathematics,Engineering (miscellaneous),Modeling and Simulation,General Computer Science
Reference22 articles.
1. Hu, S., Wang, R., & Ye, Y. (2021). Interactive information bottleneck for high-dimensional co-occurrence data clustering. Applied Soft Computing(111-), 111.
2. Bouveyron, C., Girard, S., & Schmid, C. (2007). High-dimensional data clustering. Computational Statistics & Data Analysis, 52(1), 502-519.
3. Mardia, K. V., Wiechers, H., Eltzner, B., & Huckemann, S. F. (2022). Principal component analysis and clustering on manifolds. Journal of Multivariate Analysis, 188.
4. C, M. D. A. B., A, E. F., D, A. L. A., & D, A. R. A. (2021). Automatic topography of high-dimensional data sets by non-parametric density peak clustering - sciencedirect. Information Sciences.
5. Kim, Y., Telea, A. C., Trager, S. C., & Roerdink, J. B. (2022). Visual cluster separation using high-dimensional sharpened dimensionality reduction:. Information Visualization, 21(3), 197-219.