Efficiently Estimating Motif Statistics of Large Networks-Reference-Cited by-同舟云学术

Efficiently Estimating Motif Statistics of Large Networks

Published:2014-11-17 Issue:2 Volume:9 Page:1-27
ISSN:1556-4681
Container-title:ACM Transactions on Knowledge Discovery from Data
language:en
Short-container-title:ACM Trans. Knowl. Discov. Data

Author:

Wang Pinghui¹,Lui John C. S.²,Ribeiro Bruno³,Towsley Don⁴,Zhao Junzhou⁵,Guan Xiaohong⁵

Affiliation:

1. Huawei Noah's Ark Lab, Hong Kong

2. The Chinese University of Hong Kong, Shatin, Hong Kong

3. Carnegie Mellon University, Pittsburgh, PA, USA

4. University of Massachusetts Amherst, MA, USA

5. Xi'an Jiaotong University, China

Abstract

Exploring statistics of locally connected subgraph patterns (also known as network motifs) has helped researchers better understand the structure and function of biological and Online Social Networks (OSNs). Nowadays, the massive size of some critical networks—often stored in already overloaded relational databases—effectively limits the rate at which nodes and edges can be explored, making it a challenge to accurately discover subgraph statistics. In this work, we propose sampling methods to accurately estimate subgraph statistics from as few queried nodes as possible. We present sampling algorithms that efficiently and accurately estimate subgraph properties of massive networks. Our algorithms require no precomputation or complete network topology information. At the same time, we provide theoretical guarantees of convergence. We perform experiments using widely known datasets and show that, for the same accuracy, our algorithms require an order of magnitude less queries (samples) than the current state-of-the-art algorithms.

Funder

Army Research Office

Prospective Research Project on Future Networks of Jiangsu Future Networks Innovation Institute

Application Foundation Research Program of SuZhou

Ministry of Education of the People's Republic of China

Ministry of Science and Technology of the People's Republic of China

Division of Computer and Network Systems

U.S. Army Research Laboratory

National Natural Science Foundation of China

Publisher

Association for Computing Machinery (ACM)

Subject