Abstract
Density peaks clustering (DPC) algorithm can process data of any shape and is simple and intuitive. However, the distance between any two high-dimensional points tends to be consistent, which makes it difficult to distinguish the density peaks and easily produces “bad label” delivery. To surmount the above-mentioned defects, this paper put forward a novel density peaks clustering algorithm with isolation kernel and K-induction (IKDC). The IKDC uses an optimized isolation kernel instead of the traditional distance. The optimized isolation kernel solves the problem of converging the distance between the high-dimensional samples by increasing the similarity of two samples in a sparse domain and decreasing the similarity of two samples in a dense domain. In addition, the IKDC introduces three-way clustering, uses core domains to represent dense regions of clusters, and uses boundary domains to represent sparse regions of clusters, where points in the boundary domains may belong to one or more clusters. At the same time as determining the core domains, the improved KNN and average similarity are proposed to assign as many as possible to the core domains. The K-induction is proposed to assign the leftover points to the boundary domain of the optimal cluster. To confirm the practicability and validity of IKDC, we test on 10 synthetic and 8 real datasets. The comparison with other algorithms showed that the IKDC was superior to other algorithms in multiple clustering indicators.
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference54 articles.
1. Using Clustering as a Tool: Mixed Methods in Qualitative Data Analysis;Macia;Qual. Rep.,2015
2. Clustering high dimensional data;Assent;Wiley Interdiscip. Rev. Data Min. Knowl. Discov.,2012
3. Jiang, G., Wang, H., Peng, J., and Fu, X. (2022, January 27–30). Parallelism Network with Partial-aware and Cross-correlated Transformer for Vehicle Re-identification. Proceedings of the ICMR ’22: International Conference on Multimedia Retrieval, Newark, NJ, USA.
4. Hypergraph Matching via Game-Theoretic Hypergraph Clustering;Hou;Pattern Recognit.,2022
5. An enhanced Grey Wolf Optimizer based Particle Swarm Optimizer for intrusion detection system in wireless sensor networks;Otair;Wirel. Netw.,2022
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献