Affiliation:
1. Bhagwan Parshuram Institute of Technology, New Delhi, India
Abstract
This article describes how data is relevant and if it can be organized, linked with other data and grouped into a cluster. Clustering is the process of organizing a given set of objects into a set of disjoint groups called clusters. There are a number of clustering algorithms like k-means, k-medoids, normalized k-means, etc. So, the focus remains on efficiency and accuracy of algorithms. The focus is also on the time it takes for clustering and reducing overlapping between clusters. K-means is one of the simplest unsupervised learning algorithms that solves the well-known clustering problem. The k-means algorithm partitions data into K clusters and the centroids are randomly chosen resulting numeric values prohibits it from being used to cluster real world data containing categorical values. Poor selection of initial centroids can result in poor clustering. This article deals with a proposed algorithm which is a variant of k-means with some modifications resulting in better clustering, reduced overlapping and lesser time required for clustering by selecting initial centres in k-means and normalizing the data.
Reference29 articles.
1. Improved approximation algorithms for multilevel facility location problems
2. Analysis and Approach: K-Means and K-Medoids Data Mining Algorithms.;A.Batra;5th IEEE International Conference on Advanced Computing & Communication Technologies [ICACCT‐2011],2011
3. Blömer, J., Brauer, S., & Bujna, K. (2015). Complexity and approximation of the fuzzy k-means problem. arXiv:1512.05947
4. A fuzzy k-means clustering algorithm using cluster center displacement.;C. T.Chang;Journal of Information Science and Engineering,2011
5. An Improvement in K-mean Clustering Algorithm Using Better Time and Accuracy
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献