Abstract
Kmeans clustering algorithm is an iterative unsupervised learning algorithm that tries to partition the given dataset into k pre-defined distinct non-overlapping clusters where each data point belongs to only one group. However, its performance is affected by its sensitivity to the initial cluster centroids with the possibility of convergence into local optimum and specification of cluster number as the input parameter. Recently, the hybridization of metaheuristics algorithms with the K-Means algorithm has been explored to address these problems and effectively improve the algorithm’s performance. Nonetheless, most metaheuristics algorithms require rigorous parameter tunning to achieve an optimum result. This paper proposes a hybrid clustering method that combines the well-known symbiotic organisms search algorithm with K-Means using the SOS as a global search metaheuristic for generating the optimum initial cluster centroids for the K-Means. The SOS algorithm is more of a parameter-free metaheuristic with excellent search quality that only requires initialising a single control parameter. The performance of the proposed algorithm is investigated by comparing it with the classical SOS, classical K-means and other existing hybrids clustering algorithms on eleven (11) UCI Machine Learning Repository datasets and one artificial dataset. The results from the extensive computational experimentation show improved performance of the hybrid SOSK-Means for solving automatic clustering compared to the standard K-Means, symbiotic organisms search clustering methods and other hybrid clustering approaches.
Publisher
Public Library of Science (PLoS)
Reference99 articles.
1. Hybrid Symbiotic Organism Search algorithms for Automatic Data Clustering;V. Rajah;Conf. Inf. Commun. Technol. Soc. ICTAS 2020—Proc.,2020
2. Automatic Data Clustering Using Hybrid Firefly Particle Swarm Optimization Algorithm;M. B. Agbaje;IEEE Access,2019
3. Simultaneous Pattern and Data Clustering for Pattern Cluster Analysis;A. K. C. Wong;IEEE Transactions on Knowledge and Data Engineering,2008
4. Cluster analysis and mathematical programming;P. Hansen,1997
5. Efficient and Effective Clustering Methods for Spatial Data Mining 1 Introduction;R. T. Ng;Proceedings of VLDB,1994
Cited by
16 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献