A Fast Algorithm to Initialize Cluster Centroids in Fuzzy Clustering Applications-Reference-Cited by-同舟云学术

A Fast Algorithm to Initialize Cluster Centroids in Fuzzy Clustering Applications

Published:2020-09-15 Issue:9 Volume:11 Page:446
ISSN:2078-2489
Container-title:Information
language:en
Short-container-title:Information

Author:

Cebeci Zeynel^ORCID,Cebeci Cagatay^ORCID

Abstract

The goal of partitioning clustering analysis is to divide a dataset into a predetermined number of homogeneous clusters. The quality of final clusters from a prototype-based partitioning algorithm is highly affected by the initially chosen centroids. In this paper, we propose the InoFrep, a novel data-dependent initialization algorithm for improving computational efficiency and robustness in prototype-based hard and fuzzy clustering. The InoFrep is a single-pass algorithm using the frequency polygon data of the feature with the highest peaks count in a dataset. By using the Fuzzy C-means (FCM) clustering algorithm, we empirically compare the performance of the InoFrep on one synthetic and six real datasets to those of two common initialization methods: Random sampling of data points and K-means++. Our results show that the InoFrep algorithm significantly reduces the number of iterations and the computing time required by the FCM algorithm. Additionally, it can be applied to multidimensional large datasets because of its shorter initialization time and independence from dimensionality due to working with only one feature with the highest number of peaks.

Publisher

MDPI AG

Subject

Information Systems

Link

https://www.mdpi.com/2078-2489/11/9/446/pdf

Reference21 articles.

1. Prototype-based Classification and Clustering;Borgelt,2005

2. Introduction to Partitioning-based Clustering Methods with a Robust Example;Äyrämö,2006

3. A novel initialization scheme for the fuzzy c-means algorithm for color clustering

4. Introduction to five clustering algorithms;Moertini;Integral,2002

5. A Survey of Clustering Algorithms for Big Data: Taxonomy and Empirical Analysis

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Certain Investigation on Perpetualistic Fuzzy Outlier Data for Efficiency Evaluation of Centroid Stability with Cluster Boundary Fitness;Data Analytics and Artificial Intelligence;2023-01-01

2. Applying radial basis function neural network for comprehending properties of each cluster of fuzzy c-means in coordinates analysis (case study in Iran);2022-12-08

3. Scalable Fuzzy Clustering With Anchor Graph;IEEE Transactions on Knowledge and Data Engineering;2022