Clustering Large Attributed Graphs-Reference-Cited by-同舟云学术

Clustering Large Attributed Graphs

Published:2011-02 Issue:2 Volume:5 Page:1-33
ISSN:1556-4681
Container-title:ACM Transactions on Knowledge Discovery from Data
language:en
Short-container-title:ACM Trans. Knowl. Discov. Data

Author:

Cheng Hong¹,Zhou Yang¹,Yu Jeffrey Xu¹

Affiliation:

1. The Chinese University of Hong Kong

Abstract

Social networks, sensor networks, biological networks, and many other information networks can be modeled as a large graph. Graph vertices represent entities, and graph edges represent their relationships or interactions. In many large graphs, there is usually one or more attributes associated with every graph vertex to describe its properties. In many application domains, graph clustering techniques are very useful for detecting densely connected groups in a large graph as well as for understanding and visualizing a large graph. The goal of graph clustering is to partition vertices in a large graph into different clusters based on various criteria such as vertex connectivity or neighborhood similarity. Many existing graph clustering methods mainly focus on the topological structure for clustering, but largely ignore the vertex properties, which are often heterogenous. In this article, we propose a novel graph clustering algorithm, SA-Cluster , which achieves a good balance between structural and attribute similarities through a unified distance measure. Our method partitions a large graph associated with attributes into k clusters so that each cluster contains a densely connected subgraph with homogeneous attribute values. An effective method is proposed to automatically learn the degree of contributions of structural similarity and attribute similarity. Theoretical analysis is provided to show that SA-Cluster is converging quickly through iterative cluster refinement. Some optimization techniques on matrix computation are proposed to further improve the efficiency of SA-Cluster on large graphs. Extensive experimental results demonstrate the effectiveness of SA-Cluster through comparisons with the state-of-the-art graph clustering and summarization methods.

Funder

Chinese University of Hong Kong

Research Grants Council, University Grants Committee, Hong Kong

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/1921632.1921638

Reference31 articles.

1. Automatic subspace clustering of high dimensional data for data mining applications

2. Apostol T. M. 1967. Calculus Vol. 1: One-Variable Calculus with an Introduction to Linear Algebra 2nd Ed. Wiley. Apostol T. M. 1967. Calculus Vol. 1: One-Variable Calculus with an Introduction to Linear Algebra 2nd Ed. Wiley.

3. Mining hidden community in heterogeneous social networks

4. Descartes R. 1954. The Geometry of René Descartes. Dover Publications. Descartes R. 1954. The Geometry of René Descartes . Dover Publications.

Cited by 118 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An efficient graph embedding clustering approach for heterogeneous network;The Journal of Supercomputing;2024-05-28

2. Detecting communities with multiple topics in attributed networks via self-supervised adaptive graph convolutional network;Information Fusion;2024-05

3. Accurate multi-view clustering to seek the cross-viewed yet uniform sample assignment via tensor feature matching;Information Sciences;2024-04

4. Similarity enhancement of heterogeneous networks by weighted incorporation of information;Knowledge and Information Systems;2024-01-27

5. Three-Dimensional Model Resume Transfer Technology for the Entire Lifecycle of Power Grid Equipment;Smart Innovation, Systems and Technologies;2024