Affiliation:
1. Zhejiang Lab & Huazhong University of Science and Technology, China
2. Huazhong University of Science and Technology, China
3. University of Warwick, UK
4. National University of Singapore, Singapore
5. University of Southern Queensland, Australia
6. ANT Group, China
Abstract
With the increasing need for graph analysis, massive
Concurrent iterative Graph Processing
(CGP) jobs are usually performed on the common large-scale real-world graph. Although several solutions have been proposed, these CGP jobs are not coordinated with the consideration of the inherent dependencies in graph data driven by graph topology. As a result, they suffer from redundant and fragmented accesses of the same underlying graph dispersed over distributed platform, because the same graph is typically irregularly traversed by these jobs along different paths at the same time.
In this work, we develop
GraphTune
, which can be integrated into existing distributed graph processing systems, such as D-Galois, Gemini, PowerGraph, and Chaos, to efficiently perform CGP jobs and enhance system throughput. The key component of GraphTune is a dependency-aware synchronous execution engine in conjunction with several optimization strategies based on the constructed cross-iteration dependency graph of chunks. Specifically, GraphTune transparently regularizes the processing behavior of the CGP jobs in a novel synchronous way and assigns the chunks of graph data to be handled by them based on the topological order of the dependency graph so as to maximize the performance. In this way, it can transform the irregular accesses of the chunks into more regular ones so that as many CGP jobs as possible can fully share the data accesses to the common graph. Meanwhile, it also efficiently synchronizes the communications launched by different CGP jobs based on the dependency graph to minimize the communication cost. We integrate it into four cutting-edge distributed graph processing systems and a popular out-of-core graph processing system to demonstrate the efficiency of GraphTune. Experimental results show that GraphTune improves the throughput of CGP jobs by 3.1∼6.2, 3.8∼8.5, 3.5∼10.8, 4.3∼12.4, and 3.8∼6.9 times over D-Galois, Gemini, PowerGraph, Chaos, and GraphChi, respectively.
Funder
National Key Research and Development Program of China
NSFC
Major Scientific Research Project of Zhejiang Lab
CCF-AFSG Research Fund
Young Top-notch Talent Cultivation Program of Hubei Province
Publisher
Association for Computing Machinery (ACM)
Subject
Hardware and Architecture,Information Systems,Software
Reference63 articles.
1. Facebook. 2023. Retrieved 21 March 2023 from http://www.facebook.com/.
2. Google. 2023. Retrieved 21 March 2023 from http://www.google.com/.
3. Graph500. 2023. Retrieved 21 March 2023 from http://graph500.org/?page_id=12#sec-6_1.com/.
4. LAW. 2023. Retrieved 21 March 2023 from http://law.di.unimi.it/datasets.php.
5. SNAP. 2023. Retrieved 21 March 2023 from http://snap.stanford.edu/data/index.html.