Affiliation:
1. School of Science, Hubei University of Technology, Wuhan 430068, China
2. School of Computer Science and Technology, Wuhan University of Bioengineering, Wuhan 430060, China
Abstract
This study aims to improve the quality of the clustering results of the density peak clustering (DPC) algorithm and address the privacy protection problem in the clustering analysis process. To achieve this, a DPC algorithm based on Chebyshev inequality and differential privacy (DP-CDPC) is proposed. Firstly, the distance matrix is calculated using cosine distance instead of Euclidean distance when dealing with high-dimensional datasets, and the truncation distance is automatically calculated using the dichotomy method. Secondly, to solve the difficulty in selecting suitable clustering centers in the DPC algorithm, statistical constraints are constructed from the perspective of the decision graph using Chebyshev inequality, and the selection of clustering centers is achieved by adjusting the constraint parameters. Finally, to address the privacy leakage problem in the cluster analysis, the Laplace mechanism is applied to introduce noise to the local density in the process of cluster analysis, enabling the privacy protection of the algorithm. The experimental results demonstrate that the DP-CDPC algorithm can effectively select the clustering centers, improve the quality of clustering results, and provide good privacy protection performance.
Funder
National Natural Science Foundation of China
Hubei Provincial Department of Education
Hubei University of Technology
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference39 articles.
1. Applications and Challenges in Healthcare Big Data: A Strategic Review;Khanna;Curr. Med. Imaging,2023
2. SecEDMO: Enabling Efficient Data Mining with Strong Privacy Protection in Cloud Computing;Wu;IEEE Trans. Cloud Comput.,2022
3. Representation Learning Based on Autoencoder and Deep Adaptive Clustering for Image Clustering;Yu;Math. Probl. Eng.,2021
4. Clustering Methodologies for Software Engineering;Shtern;Adv. Softw. Eng.,2012
5. Review of Clustering Technology and Its Application in Coordinating Vehicle Subsystems;Zhang;Automot. Innov.,2023
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献