Affiliation:
1. University of Texas at Arlington
2. George Washington University
Abstract
In this paper, we introduce a novel, general purpose, technique for faster sampling of nodes over an online social network. Specifically, unlike traditional random walks which wait for the convergence of sampling distribution to a predetermined target distribution - a waiting process that incurs a high query cost - we develop WALK-ESTIMATE, which starts with a much shorter random walk, and then proactively estimate the sampling probability for the node taken before using acceptance-rejection sampling to adjust the sampling probability to the predetermined target distribution. We present a novel backward random walk technique which provides provably unbiased estimations for the sampling probability, and demonstrate the superiority of WALK-ESTIMATE over traditional random walks through theoretical analysis and extensive experiments over real world online social networks.
Subject
General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development
Cited by
20 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Random Walk Sampling in Social Networks Involving Private Nodes;ACM Transactions on Knowledge Discovery from Data;2023-02-24
2. Social Graph Restoration via Random Walk Sampling;2022 IEEE 38th International Conference on Data Engineering (ICDE);2022-05
3. Sampling Multiple Nodes in Large Networks;Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining;2022-02-11
4. Common Neighbors Matter: Fast Random Walk Sampling with Common Neighbor Awareness;IEEE Transactions on Knowledge and Data Engineering;2022
5. A Low Discrepancy Sequence on Graphs;Journal of Fourier Analysis and Applications;2021-08-25