Motif Counting Beyond Five Nodes-Reference-Cited by-同舟云学术

Motif Counting Beyond Five Nodes

Published:2018-08-31 Issue:4 Volume:12 Page:1-25
ISSN:1556-4681
Container-title:ACM Transactions on Knowledge Discovery from Data
language:en
Short-container-title:ACM Trans. Knowl. Discov. Data

Author:

Bressan Marco¹^ORCID,Chierichetti Flavio¹,Kumar Ravi²,Leucci Stefano³,Panconesi Alessandro¹

Affiliation:

1. Sapienza University of Rome, Roma, Italy

2. Google Research, CA, USA

3. ETH Zürich, Zürich, Switzerland

Abstract

Counting graphlets is a well-studied problem in graph mining and social network analysis. Recently, several papers explored very simple and natural algorithms based on Monte Carlo sampling of Markov Chains (MC), and reported encouraging results. We show, perhaps surprisingly, that such algorithms are outperformed by color coding (CC) [2], a sophisticated algorithmic technique that we extend to the case of graphlet sampling and for which we prove strong statistical guarantees. Our computational experiments on graphs with millions of nodes show CC to be more accurate than MC; furthermore, we formally show that the mixing time of the MC approach is too high in general, even when the input graph has high conductance. All this comes at a price however. While MC is very efficient in terms of space, CC’s memory requirements become demanding when the size of the input graph and that of the graphlets grow. And yet, our experiments show that CC can push the limits of the state-of-the-art, both in terms of the size of the input graph and of that of the graphlets.

Funder

European Research Council

Sapienza Univ. Rome

Google

MIUR

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3186586

Reference32 articles.

1. Efficient Graphlet Counting for Large Networks

2. Color-coding

3. GUISE: Uniform Sampling of Graphlets for Large Graph Analysis

4. Layered label propagation

5. The webgraph framework I

Cited by 31 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Fast and Perfect Sampling of Subgraphs and Polymer Systems;ACM Transactions on Algorithms;2024-01-22

2. Efficient and Near-optimal Algorithms for Sampling Small Connected Subgraphs;ACM Transactions on Algorithms;2023-06-24

3. Efficient Biclique Counting in Large Bipartite Graphs;Proceedings of the ACM on Management of Data;2023-05-26

4. MaNIACS : Approximate Mining of Frequent Subgraph Patterns through Sampling;ACM Transactions on Intelligent Systems and Technology;2023-04-13

5. Towards Efficient Shortest Path Counting on Billion-Scale Graphs;2023 IEEE 39th International Conference on Data Engineering (ICDE);2023-04