A weighted k-mean clustering algorithm based on singular values with offset clustering centers

Author:

deng shaobo1,lin xing1,Yuan Weili1,Liao Zemin1,Guan Sujie1,Li Min1

Affiliation:

1. Nanchang Institute of Technology

Abstract

Abstract

The K-means algorithm is widely used for dataset clustering, but it does not consider the importance of each attribute dimension when dealing with feature attributes and clustering center selection, but rather treats all attributes as having equal importance. In order to solve this problem, this paper proposes a weighted k-mean clustering algorithm (SVW-KMeans) based on singular values with offset clustering centers. The algorithm calculates the weight information of the data points through singular value decomposition to focus on the most significant and most different features, joining the weight calculation to optimize the objective function, and at the same time, the weighted arithmetic mean of the individuals is used as the clustering center, and the clustering center is shifted towards the high importance so as to take into full consideration of the importance of the different features in the clustering process. The experimental results show that the SVW-KMeans algorithm outperforms other algorithms in clustering on synthetic and real datasets, which verifies that the SVW-KMeans algorithm outperforms other mainstream clustering algorithms in terms of clustering quality and stability.

Publisher

Springer Science and Business Media LLC

Reference22 articles.

1. MacQueen J (1967) Some methods for classification and analysis of multivariate observations[C] Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability. Oakland,USA:University of California Press,1(14):281–297

2. Zhaohong. Survey on challenges in clustering analysis research;Wang J;Control Decis,2012

3. rthur D, Vassilvitskii S (2007) K-means+ +:The advantages of careful seeding [C] Proceedings of the 18th Annual ACM-SIAM Symposium on Discrete Algorithms. Philadelphia, USA: Society for Industrial and Applied Mathematics,:1027–1035

4. The k-means-u* algorithm: non-local jumps and greedy retries improve k-means + + clustering;Fritzke B;CoRR,2017

5. Zhang,and Zhaoyang Qu. Visual Extraction of Refined Operation Mode of New Power System Based on IPSO-Kmeans;Guo X;Electronics,2023

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3