Author:
Wang Zhenxin,Liu Tao,Wang Yujie,Bao Xianglin,Xu Xiaofeng,Huang Xiangxiang,Cheng Bin
Abstract
Abstract
Clustering anonymity is a common social network data privacy protection scheme, which is based on graph-clustering. Many existing graph clustering methods mainly focus on the relationship between the structure and attributes of nodes, and the difference between them due to the metric usually causes the problem of poor clustering results. To address the shortcomings in the above graph-clustering methods, a graph-clustering anonymity method implemented with fused distance-attributes (GCA-DA) is proposed. Firstly, the algorithm quantifies the distance and attribute similarity between nodes separately and balances the metric differences between them to calculate the integrated similarity. Then all the nodes in the graph are clustered into clusters according to the integrated similarity between two nodes, each of which contains no fewer than k nodes. Finally, all the clusters are anonymized. In this method, the attribute generalization for every cluster can prevent attacks by the background knowledge of structure and attributes. In addition, the attributes are divided into numerical and non-numerical attributes to measure them separately, therefore can maintain the usability of the data better. Experiment results demonstrate the effectiveness in improving the quality of clustering and reducing information loss.
Subject
Computer Science Applications,History,Education