Adaptive Initialization Method for K-Means Algorithm-Reference-Cited by-同舟云学术

Adaptive Initialization Method for K-Means Algorithm

Published:2021-11-25 Issue: Volume:4 Page:
ISSN:2624-8212
Container-title:Frontiers in Artificial Intelligence
language:
Short-container-title:Front. Artif. Intell.

Author:

Yang Jie,Wang Yu-Kai,Yao Xin,Lin Chin-Teng

Abstract

The K-means algorithm is a widely used clustering algorithm that offers simplicity and efficiency. However, the traditional K-means algorithm uses a random method to determine the initial cluster centers, which make clustering results prone to local optima and then result in worse clustering performance. In this research, we propose an adaptive initialization method for the K-means algorithm (AIMK) which can adapt to the various characteristics in different datasets and obtain better clustering performance with stable results. For larger or higher-dimensional datasets, we even leverage random sampling in AIMK (name as AIMK-RS) to reduce the time complexity. 22 real-world datasets were applied for performance comparisons. The experimental results show AIMK and AIMK-RS outperform the current initialization methods and several well-known clustering algorithms. Specifically, AIMK-RS can significantly reduce the time complexity to O (n). Moreover, we exploit AIMK to initialize K-medoids and spectral clustering, and better performance is also explored. The above results demonstrate superior performance and good scalability by AIMK or AIMK-RS. In the future, we would like to apply AIMK to more partition-based clustering algorithms to solve real-life practical problems.

Publisher

Frontiers Media SA

Reference37 articles.

1. OPTICS: Ordering Points to Identify the Clustering Structure;Anerst,1999

2. K-Means++: The Advantages of Careful Seeding;Arthur,2007

3. FCM: The Fuzzy C-Means Clustering Algorithm;Bezdek;Comput. Geosci.,1984

4. Graph K-Means Based on Leader Identification, Dynamic Game, and Opinion Dynamics;Bu;IEEE Trans. Knowl. Data Eng.,2020

5. An Initialization Method for the K-Means Algorithm Using Neighborhood Model;Cao;Comput. Maths. Appl.,2009

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Constructing a Hospital Department Development–Level Assessment Model: Machine Learning and Expert Consultation Approach in Complex Hospital Data Environments;JMIR Formative Research;2024-09-04

2. Enhanced Adjacency-Constrained Hierarchical Clustering Using Fine-Grained Pseudo Labels;IEEE Transactions on Emerging Topics in Computational Intelligence;2024-06

3. Cooperative Coverage Path Planning for Multi-Mobile Robots Based on Improved K-Means Clustering and Deep Reinforcement Learning;Electronics;2024-02-29

4. Exploring the spatiotemporal relationship between influenza and air pollution in Fuzhou using spatiotemporal weighted regression model;Scientific Reports;2024-02-19

5. Classification and clustering;Decision-Making Models;2024