Abstract
Among many clustering algorithms, the K-means clustering algorithm is widely used because of its simple algorithm and fast convergence. However, the K-value of clustering needs to be given in advance and the choice of K-value directly affect the convergence result. To solve this problem, we mainly analyze four K-value selection algorithms, namely Elbow Method, Gap Statistic, Silhouette Coefficient, and Canopy; give the pseudo code of the algorithm; and use the standard data set Iris for experimental verification. Finally, the verification results are evaluated, the advantages and disadvantages of the above four algorithms in a K-value selection are given, and the clustering range of the data set is pointed out.
Subject
Psychiatry and Mental health
Reference19 articles.
1. K-means text clustering algorithm based on centers selection according to maximum distance;Zhai;Appl. Res. Comput.,2014
2. Clustering Algorithms Research
3. The parallel implementation and application of an improved k-means algorithm;Li;J. Univ. Electron. Sci. Technol. China,2017
4. An efficient k-means clustering algorithm: analysis and implementation
Cited by
481 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献