The k-means Algorithm: A Comprehensive Survey and Performance Evaluation-Reference-Cited by-同舟云学术

The k-means Algorithm: A Comprehensive Survey and Performance Evaluation

Published:2020-08-12 Issue:8 Volume:9 Page:1295
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Ahmed Mohiuddin^ORCID,Seraj Raihan,Islam Syed Mohammed Shamsul^ORCID

Abstract

The k-means clustering algorithm is considered one of the most powerful and popular data mining algorithms in the research community. However, despite its popularity, the algorithm has certain limitations, including problems associated with random initialization of the centroids which leads to unexpected convergence. Additionally, such a clustering algorithm requires the number of clusters to be defined beforehand, which is responsible for different cluster shapes and outlier effects. A fundamental problem of the k-means algorithm is its inability to handle various data types. This paper provides a structured and synoptic overview of research conducted on the k-means algorithm to overcome such shortcomings. Variants of the k-means algorithms including their recent developments are discussed, where their effectiveness is investigated based on the experimental analysis of a variety of datasets. The detailed experimental analysis along with a thorough comparison among different k-means clustering algorithms differentiates our work compared to other existing survey papers. Furthermore, it outlines a clear and thorough understanding of the k-means algorithm along with its different research directions.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/9/8/1295/pdf

Reference76 articles.

1. Foundations of Machine Learning;Mohri,2012

2. Neural Networks for Pattern Recognition;Bishop,1995

3. Data clustering

4. An Unsupervised Approach of Knowledge Discovery from Big Data in Social Network

Cited by 609 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multi-objective evolutionary algorithm based on transfer learning and neural networks: Dual operator feature fusion and weight vector adaptation;Information Sciences;2025-01

2. Street landscape environment design based on visual technology and entertainment robots: Computer simulation gamification landscape design;Entertainment Computing;2025-01

3. Enhancing autonomous pavement crack detection: Optimizing YOLOv5s algorithm with advanced deep learning techniques;Measurement;2025-01

4. Optimization of temperature uniformity for multi microwave source alternating heating based on superpermutation planning enhanced by Harris Hawks optimization;International Journal of Thermal Sciences;2024-12

5. Sentiment analysis based on text information enhancement and multimodal feature fusion;Pattern Recognition;2024-12