Affiliation:
1. School of Computer Science and Technology, Anhui University of Technology, Ma’anshan 243032, China
Abstract
Since the Fuzzy C-Means algorithm is incapable of considering the influence of different features and exponential constraints on high-dimensional and complex data, a fuzzy clustering algorithm based on non-Euclidean distance combining feature weights and entropy weights is proposed. The proposed algorithm is based on the Fuzzy C-Means soft clustering algorithm to deal with high-dimensional and complex data. The objective function of the new algorithm is modified with the help of two different entropy terms and a non-Euclidean way of computing the distance. The distance calculation formula enhances the efficiency of extracting the contribution of different features. The first entropy term helps to minimize the clusters’ dispersion and maximize the negative entropy to control the clustering process, which also promotes the association between the samples. The second entropy term helps to control the weights of features since different features have different weights in the clustering process. Experiments on real-world datasets indicate that the proposed algorithm gives better clustering results than other algorithms. The experiments demonstrate the proposed algorithm’s robustness by analyzing the parameters’ sensitivity and comparing the computational distance formulas. In summary, the improved algorithm improves classification performance under noisy interference and high-dimensional datasets, increases computational efficiency, performs well in real-world high-dimensional datasets, and encourages the development of robust noise-resistant high-dimensional fuzzy clustering algorithms.
Subject
General Physics and Astronomy
Reference39 articles.
1. Data Mining: Concept, Aplications and Techniques;Wang;ASEAN J. Sci. Technol. Dev.,2000
2. An overview of statistical learning theory;Vapnik;IEEE Trans. Neural Netw.,1999
3. Patel, K.M.A., and Thakral, P. (2016, January 6–8). The best clustering algorithms in data mining. Proceedings of the 2016 International Conference on Communication and Signal Processing (ICCSP), IEEE, Melmaruvathur, India.
4. Spatial equilibrium of housing provident fund in China based on data mining cluster analysis;Jiang;Int. J. Wireless Mobile Comput.,2016
5. Model order reduction based on agglomerative hierarchical clustering;Wunsch;IEEE Trans. Neural Netw. Learn. Syst.,2018
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献