Affiliation:
1. The Chinese University of Hong Kong, Hong Kong, China
2. Renmin University of China, Beijing, China
Abstract
\em Personalized PageRank (PPR) stands as a fundamental proximity measure in graph mining. Given an input graph G with the probability of decay α, a source node s and a target node t, the PPR score π(s,t) of target t with respect to source s is the probability that an α-decay random walk starting from s stops at t. A \em single-source PPR (SSPPR) query takes an input graph G with decay probability α and a source s, and then returns the PPR π(s,v) for each node v ∈ V. Since computing an exact SSPPR query answer is prohibitive, most existing solutions turn to approximate queries with guarantees. The state-of-the-art solutions for approximate SSPPR queries are index-based and mainly focus on static graphs, while real-world graphs are usually dynamically changing. However, existing index-update schemes can not achieve a sub-linear update time. Motivated by this, we present an efficient indexing scheme for single-source PPR queries on evolving graphs. Our proposed solution is based on a classic framework that combines the forward-push technique with a random walk index for approximate PPR queries. Thus, our indexing scheme is similar to existing solutions in the sense that we store pre-sampled random walks for efficient query processing. One of our main contributions is an incremental updating scheme to maintain indexed random walks in expected O(1) time after each graph update. To achieve O(1) update cost, we need to maintain auxiliary data structures for both vertices and edges. To reduce the space consumption, we further revisit the sampling methods and propose a new sampling scheme to remove the auxiliary data structure for vertices while still supporting O(1) index update cost on evolving graphs. Extensive experiments show that our update scheme achieves orders of magnitude speed-up on update performance over existing index-based dynamic schemes without sacrificing the query efficiency.
Funder
Hong Kong RGC CRF Grant
CCF-Baidu Open Fund
Beijing Natural Science Foundation
Hong Kong ITC ITF Grant
National Natural Science Foundation of China
Hong Kong RGC GRF Grant
Hong Kong RGC ECS Grant
Publisher
Association for Computing Machinery (ACM)
Reference48 articles.
1. 2013. KONECT. http://konect.cc/networks/. 2013. KONECT. http://konect.cc/networks/.
2. 2014. SNAP Datasets. http://snap.stanford.edu/data. 2014. SNAP Datasets. http://snap.stanford.edu/data.
3. 2022. Technical Report. https://arxiv.org/abs/2212.10288. 2022. Technical Report. https://arxiv.org/abs/2212.10288.
4. 2023. Source Code. https://github.com/lalumine/firm. 2023. Source Code. https://github.com/lalumine/firm.
5. Reid Andersen Christian Borgs Jennifer T. Chayes John E. Hopcroft Vahab S. Mirrokni and Shang-Hua Teng. 2007. Local Computation of PageRank Contributions. In WAW. 150--165. Reid Andersen Christian Borgs Jennifer T. Chayes John E. Hopcroft Vahab S. Mirrokni and Shang-Hua Teng. 2007. Local Computation of PageRank Contributions. In WAW. 150--165.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Scalable Approximate Butterfly and Bi-triangle Counting for Large Bipartite Networks;Proceedings of the ACM on Management of Data;2023-12-08
2. Efficient Approximation Framework for Attribute Recommendation;Proceedings of the ACM on Management of Data;2023-12-08
3. Constrained Social Community Recommendation;Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2023-08-04