An Improved K-Means Algorithm Based on Evidence Distance-Reference-Cited by-同舟云学术

An Improved K-Means Algorithm Based on Evidence Distance

Published:2021-11-21 Issue:11 Volume:23 Page:1550
ISSN:1099-4300
Container-title:Entropy
language:en
Short-container-title:Entropy

Author:

Zhu Ailin,Hua Zexi,Shi Yu,Tang Yongchuan^ORCID,Miao Lingwei

Abstract

The main influencing factors of the clustering effect of the k-means algorithm are the selection of the initial clustering center and the distance measurement between the sample points. The traditional k-mean algorithm uses Euclidean distance to measure the distance between sample points, thus it suffers from low differentiation of attributes between sample points and is prone to local optimal solutions. For this feature, this paper proposes an improved k-means algorithm based on evidence distance. Firstly, the attribute values of sample points are modelled as the basic probability assignment (BPA) of sample points. Then, the traditional Euclidean distance is replaced by the evidence distance for measuring the distance between sample points, and finally k-means clustering is carried out using UCI data. Experimental comparisons are made with the traditional k-means algorithm, the k-means algorithm based on the aggregation distance parameter, and the Gaussian mixture model. The experimental results show that the improved k-means algorithm based on evidence distance proposed in this paper has a better clustering effect and the convergence of the algorithm is also better.

Funder

National Key Research and Development Project of China

Publisher

MDPI AG

Subject

General Physics and Astronomy

Link

https://www.mdpi.com/1099-4300/23/11/1550/pdf

Reference49 articles.

1. A Comprehensive Survey on Cloud Data Mining (CDM) Frameworks and Algorithms

2. Spatio-Temporal Data Mining

3. Research on data mining algorithm based on neural network and particle swarm optimization

4. Data mining powered by the gene ontology. Wiley Interdisciplinary Reviews;Manda;Data Min. Knowl. Discov.,2020

5. Use of data mining at the Food and Drug Administration

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An Improved K-Means Algorithm Based on Contour Similarity;Mathematics;2024-07-15

2. Refined intelligent manufacturing enterprise human management based on IoT and machine learning technology;The International Journal of Advanced Manufacturing Technology;2024-01-06

3. Research on the Construction of Digital Economy Index System Based on K-means-SA Algorithm;SAGE Open;2023-10

4. K-means Clustering Algorithm based on Improved Density Peak;Proceedings of the 2023 3rd International Conference on Bioinformatics and Intelligent Computing;2023-02-10

5. IMPT of head and neck cancer: unsupervised machine learning treatment planning strategy for reducing radiation dermatitis;Radiation Oncology;2023-01-14