Affiliation:
1. University of Massachusetts Amherst, Amherst, MA
2. Amherst College, Amherst, MA
Abstract
“[A]llain Gersten, Hopfen, und Wasser”— 1516 ReinheitsgebotWe presentBavarian, a collection of sampling-based algorithms for approximating the Betweenness Centrality (BC) of all vertices in a graph. Our algorithms use Monte-Carlo Empirical Rademacher Averages (MCERAs), a concept from statistical learning theory, to efficiently compute tight bounds on the maximum deviation of the estimates from the exact values. The MCERAs provide a sample-dependent approximation guarantee much stronger than the state-of-the-art, thanks to its use of variance-aware probabilistic tail bounds. The flexibility of the MCERAs allows us to introduce a unifying framework that can be instantiated with existing sampling-based estimators of BC, thus allowing a fair comparison between them, decoupled from the sample-complexity results with which they were originally introduced. Additionally, we prove novel sample-complexity results showing that, for all estimators, the sample size sufficient to achieve a desired approximation guarantee depends on the vertex-diameter of the graph, an easy-to-bound characteristic quantity. We also show progressive-sampling algorithms and extensions to other centrality measures, such as percolation centrality. Our extensive experimental evaluation ofBavarianshows the improvement over the state-of-the-art made possible by the MCERAs (2–4× reduction in the error bound), and it allows us to assess the different trade-offs between sample size and accuracy guarantees offered by the different estimators.
Funder
National Science Foundation
DARPA/ARFL
Publisher
Association for Computing Machinery (ACM)
Reference69 articles.
1. Ziyad AlGhamdi, Fuad Jamour, Spiros Skiadopoulos, and Panos Kalnis. 2017. A benchmark for betweenness centrality approximation algorithms on large graphs. In Proceedings of the 29th International Conference on Scientific and Statistical Database Management. ACM, 6:1--6:12.
2. Josh Alman and Virginia Vassilevska Williams. 2021. A refined laser method and faster matrix multiplication. In Proceedings of the 2021 ACM-SIAM Symposium on Discrete Algorithms. SIAM, 522–539.
3. Jac. M. Anthonisse. 1971. The Rush in a Directed Graph. Technical Report BN 9/71. Stichting Mathematisch Centrum, Amsterdam, Netherlands.
4. Proceedings of the Algorithms and Models for the Web-Graph;Bader David A.,2007
5. Rademacher and Gaussian complexities: Risk bounds and structural results;Bartlett Peter L.;Journal of Machine Learning Research,2002
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献