LargeGraph-Reference-Cited by-同舟云学术

LargeGraph

Published:2021-12-31 Issue:4 Volume:18 Page:1-24
ISSN:1544-3566
Container-title:ACM Transactions on Architecture and Code Optimization
language:en
Short-container-title:ACM Trans. Archit. Code Optim.

Author:

Zhang Yu¹^ORCID,Peng Da¹,Liao Xiaofei¹,Jin Hai¹,Liu Haikun¹,Gu Lin¹,He Bingsheng²

Affiliation:

1. National Engineering Research Center for Big Data Technology and System, Service Computing Technology and System Lab, Cluster and Grid Computing Lab, School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, China

2. National University of Singapore, Singapore

Abstract

Many out-of-GPU-memory systems are recently designed to support iterative processing of large-scale graphs. However, these systems still suffer from long time to converge because of inefficient propagation of active vertices’ new states along graph paths. To efficiently support out-of-GPU-memory graph processing, this work designs a system LargeGraph . Different from existing out-of-GPU-memory systems, LargeGraph proposes a dependency-aware data-driven execution approach , which can significantly accelerate active vertices’ state propagations along graph paths with low data access cost and also high parallelism. Specifically, according to the dependencies between the vertices, it only loads and processes the graph data associated with dependency chains originated from active vertices for smaller access cost. Because most active vertices frequently use a small evolving set of paths for their new states’ propagation because of power-law property, this small set of paths are dynamically identified and maintained and efficiently handled on the GPU to accelerate most propagations for faster convergence, whereas the remaining graph data are handled over the CPU. For out-of-GPU-memory graph processing, LargeGraph outperforms four cutting-edge systems: Totem (5.19–11.62×), Graphie (3.02–9.41×), Garaph (2.75–8.36×), and Subway (2.45–4.15×).

Funder

National Natural Science Foundation of China

Zhejiang Lab

Fundamental Research Funds for the Central Universities

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Information Systems,Software

Link

https://dl.acm.org/doi/pdf/10.1145/3477603

Reference55 articles.

1. Stanford. 2020. Stanford Large Network Dataset Collection.http://snap.stanford.edu/data/index.html. Stanford. 2020. Stanford Large Network Dataset Collection.http://snap.stanford.edu/data/index.html.

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. DELTA: Memory-Efficient Training via Dynamic Fine-Grained Recomputation and Swapping;ACM Transactions on Architecture and Code Optimization;2024-08-20

2. GRIT: Enhancing Multi-GPU Performance with Fine-Grained Dynamic Page Placement;2024 IEEE International Symposium on High-Performance Computer Architecture (HPCA);2024-03-02

3. INFINEL: An efficient GPU-based processing method for unpredictable large output graph queries;Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming;2024-02-20

4. CGgraph: An Ultra-Fast Graph Processing System on Modern Commodity CPU-GPU Co-processor;Proceedings of the VLDB Endowment;2024-02

5. A Bucket-aware Asynchronous Single-Source Shortest Path Algorithm on GPU;Proceedings of the 52nd International Conference on Parallel Processing Workshops;2023-08-07