Sampling Graphlets of Multiplex Networks: A Restricted Random Walk Approach
-
Published:2021-06-11
Issue:4
Volume:15
Page:1-31
-
ISSN:1559-1131
-
Container-title:ACM Transactions on the Web
-
language:en
-
Short-container-title:ACM Trans. Web
Author:
Jiao Simiao1,
Xue Zihui1,
Chen Xiaowei2,
Xu Yuedong1ORCID
Affiliation:
1. School of Information Science and Engineering, Fudan University
2. Bytedance Inc.
Abstract
Graphlets are induced subgraph patterns that are crucial to the understanding of the structure and function of a large network. A lot of effort has been devoted to calculating graphlet statistics where random walk-based approaches are commonly used to access restricted graphs through the available application programming interfaces (APIs). However, most of them merely consider individual networks while overlooking the strong coupling between different networks. In this article, we estimate the graphlet concentration in
multiplex
networks with real-world applications. An inter-layer edge connects two nodes in different layers if they actually belong to the same node. The access to a multiplex network is restrictive in the sense that the upper layer allows random walk sampling, whereas the nodes of lower layers can be accessed only through the inter-layer edges and only support random node or edge sampling. To cope with this new challenge, we define a suit of two-layer graphlets and propose novel random walk sampling algorithms to estimate the proportion of all the three-node graphlets. An analytical bound on the sampling steps is proved to guarantee the convergence of our unbiased estimator. We further generalize our algorithm to explore the tradeoff between the estimated accuracy of different graphlets when the sample budget is split into different layers. Experimental evaluation on real-world and synthetic multiplex networks demonstrates the accuracy and high efficiency of our unbiased estimators.
Funder
National Key R&D Program of China
Natural Science Foundation of China
Shanghai-Hong Kong Collaborative Project
Key-Area Research and Development Program of Guangdong Province
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications
Reference53 articles.
1. N. K. Ahmed N. Duffield T. Willke and R. A. Rossi. 2017. On sampling from massive graph streams. arXiv:1703.02625. Retrieved from https://arxiv.org/abs/1703.02625. N. K. Ahmed N. Duffield T. Willke and R. A. Rossi. 2017. On sampling from massive graph streams. arXiv:1703.02625. Retrieved from https://arxiv.org/abs/1703.02625.
2. On sampling from massive graph streams;Ahmed N. K.;VLDB J.,2017
3. Network sampling: From static to streaming graphs;Ahmed N. K.;ACM Trans. Knowl. Discov. Data,2013
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Graphlets in multilayer networks;Journal of Complex Networks;2021-03-03