Enhancement of <i>K</i>-means clustering in big data based on equilibrium optimizer algorithm-Reference-Cited by-同舟云学术

Enhancement of K-means clustering in big data based on equilibrium optimizer algorithm

Published:2023-01-01 Issue:1 Volume:32 Page:
ISSN:2191-026X
Container-title:Journal of Intelligent Systems
language:en
Short-container-title:

Author:

Al-kababchee Sarah Ghanim Mahmood¹²,Algamal Zakariya Yahya³⁴^ORCID,Qasim Omar Saber¹

Affiliation:

1. Department of Mathematics, University of Mosul , 41002 Mosul , Iraq

2. Department of Mathematics, Education College, University of AL-Hamdaniya , 41019 Bartella , Iraq

3. Department of Statistics and Informatics, University of Mosul , 41002 Mosul , Iraq

4. College of Engineering, University of Warith Al-Anbiyaa , 56001 Karbala , Iraq

Abstract

Abstract Data mining’s primary clustering method has several uses, including gene analysis. A set of unlabeled data is divided into clusters using data features in a clustering study, which is an unsupervised learning problem. Data in a cluster are more comparable to one another than to those in other groups. However, the number of clusters has a direct impact on how well the K-means algorithm performs. In order to find the best solutions for these real-world optimization issues, it is necessary to use techniques that properly explore the search spaces. In this research, an enhancement of K-means clustering is proposed by applying an equilibrium optimization approach. The suggested approach adjusts the number of clusters while simultaneously choosing the best attributes to find the optimal answer. The findings establish the usefulness of the suggested method in comparison to existing algorithms in terms of intra-cluster distances and Rand index based on five datasets. Through the results shown and a comparison of the proposed method with the rest of the traditional methods, it was found that the proposal is better in terms of the internal dimension of the elements within the same cluster, as well as the Rand index. In conclusion, the suggested technique can be successfully employed for data clustering and can offer significant support.

Publisher

Walter de Gruyter GmbH

Subject

Artificial Intelligence,Information Systems,Software

Link

https://www.degruyter.com/document/doi/10.1515/jisys-2022-0230/pdf

Reference40 articles.

1. Barbakh WA, Wu Y, Fyfe C. Non-standard parameter adaptation for exploratory data analysis. Vol. 249. Berlin, Heidelberg: Springer; 2009.

2. Berikov V. Weighted ensemble of algorithms for complex data clustering. Pattern Recognit Lett. 2014;38:99–106.

3. Han X, Quan L, Xiong X, Almeter M, Xiang J, Lan Y. A novel data clustering algorithm based on modified gravitational search algorithm. Eng Appl Artif Intell. 2017;61:1–7.

4. Jain AK. Data clustering: 50 years beyond K-means. Pattern Recognit Lett. 2010;31(8):651–66.

5. Bishop CM, Nasrabadi NM. Pattern recognition and machine learning. Pattern Recognit Lett. 2006;128(9):651–66.

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An ordered subsets orthogonal nonnegative matrix factorization framework with application to image clustering;International Journal of Machine Learning and Cybernetics;2024-08-30

2. Sparse kernel k -means clustering;Journal of Applied Statistics;2024-06-05

3. DBGSA: A novel data adaptive bregman clustering algorithm;Engineering Applications of Artificial Intelligence;2024-05

4. Local Adaptive Clustering Based Image Matching for Automatic Visual Identification;2023 China Automation Congress (CAC);2023-11-17

5. Extended ADMM for general penalized quantile regression with linear constraints in big data;Communications in Statistics - Simulation and Computation;2023-08-31