Abstract
Clustering analysis is a significant technique in various fields, including unsupervised machine learning, data mining, pattern recognition, and image analysis. Many clustering algorithms are currently used, but almost all of them encounter various challenges, such as low accuracy, required number of clusters, slow processing, inability to produce non-spherical shaped clusters, and unstable performance with respect to data characteristics and size. In this research, a novel clustering algorithm called the fast component density clustering in spatial databases (FCDCSD) is proposed by utilizing a density-based clustering technique to address the aforementioned existing challenges. First, from the smallest to the largest point in the spatial field, each point is labeled with a temporary value, and the adjacent values in one component are stored in a set. Then, all sets with shared values are merged and resolved to obtain a single value that is representative of the merged sets. These values represent final cluster values; that is, the temporary equivalents in the dataset are replaced to generate the final clusters. If some noise appears, then a post-process is performed, and values are assigned to the nearest cluster based on a set of rules. Various synthetic datasets were used in the experiments to evaluate the efficiency of the proposed method. Results indicate that FCDCSD is generally superior to affinity propagation, agglomerative hierarchical, k-means, mean-shift, spectral, and density-based spatial clustering of applications with noise, ordering points for identifying clustering structures, and Gaussian mixture clustering methods.
Reference52 articles.
1. Explore unlabeled big data learning to online failure prediction in safety-aware cloud environment;Zhao;J. Parallel Distrib. Comput.,2021
2. A fast density peaks clustering algorithm with sparse search;Xu;Inf. Sci.,2021
3. Divide well to merge better: A novel clustering algorithm;Rehman;Pattern Recognit.,2022
4. Systematic review of unsupervised genomic clustering algorithms techniques for high dimensional datasets;Abdulazeez;Technol. Rep. Kansai Univ.,2020
5. A study of graph-based system for multi-view clustering;Wang;Knowledge-Based Syst.,2019
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献