Accelerating Graph Computations on 3D NoC-Enabled PIM Architectures-Reference-Cited by-同舟云学术

Accelerating Graph Computations on 3D NoC-Enabled PIM Architectures

Published:2023-03-19 Issue:3 Volume:28 Page:1-16
ISSN:1084-4309
Container-title:ACM Transactions on Design Automation of Electronic Systems
language:en
Short-container-title:ACM Trans. Des. Autom. Electron. Syst.

Author:

Choudhury Dwaipayan¹^ORCID,Xiang Lizhi¹^ORCID,Rajam Aravind¹^ORCID,Kalyanaraman Anantharaman¹^ORCID,Pande Partha Pratim¹^ORCID

Affiliation:

1. Washington State University, Pullman, WA

Abstract

Graph application workloads are dominated by random memory accesses with the poor locality. To tackle the irregular and sparse nature of computation, ReRAM-based Processing-in-Memory (PIM) architectures have been proposed recently. Most of these ReRAM architecture designs have focused on mapping graph computations into a set of multiply-and-accumulate (MAC) operations. ReRAMs also offer a key advantage in reducing memory latency between cores and memory by allowing for PIM. However, when implemented on a ReRAM-based manycore architecture, graph applications still pose two key challenges—significant storage requirements (particularly due to wasted zero cell storage), and significant amount of on-chip traffic. To tackle these two challenges, in this article, we propose the design of a 3D NoC-enabled ReRAM-based manycore architecture. Our proposed architecture incorporates a novel crossbar-aware node reordering to reduce ReRAM storage requirements. Secondly, its 3D NoC-enabled design reduces on-chip communication latency. Our architecture outperforms the state-of-the-art in ReRAM-based graph acceleration by up to 5× in performance while consuming up to 10.3× less energy for a range of graph inputs and workloads.

Funder

US National Science Foundation

Publisher

Association for Computing Machinery (ACM)

Subject

Electrical and Electronic Engineering,Computer Graphics and Computer-Aided Design,Computer Science Applications

Link

https://dl.acm.org/doi/pdf/10.1145/3564290

Reference25 articles.

1. K. A. Kalyanaraman and P. Pande. 2019. A brief survey of algorithms, architectures, and challenges toward extreme-scale graph analytics. In Proceedings of the Design, Automation & Test in Europe Conference & Exhibition. 1307–1312.

2. L. Zheng, J. Zhao, Y. Huang, Q. Wang, Z. Zeng, J. Xue, X. Liao, and H. Jin. 2020. Spara: An energy-efficient ReRAM-Based accelerator for sparse graph analytics applications. In Proceedings of the IEEE International Parallel and Distributed Processing Symposium (IPDPS). 696–707.

3. G. Dai, T. Huang, Y. Wang, H. Yang, and J. Wawrzynek. 2019. GraphSAR: A sparsity-aware processing-in-memory architecture for large-scale graph processing on ReRAMs. In Proceedings of the Asia and South Pacific Design Automation Conference. 120–126.

4. A. A. Maashri, G. Sun, X. Dong, V. Narayanan, and Y. Xie. 2009. 3D GPU architecture using cache stacking: Performance, cost, power and thermal analysis. In Proceedings of the IEEE International Conference on Computer Design. 254–259.

5. Energy Efficient Architecture for Graph Analytics Accelerators

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. TEFLON: Thermally Efficient Dataflow-aware 3D NoC for Accelerating CNN Inferencing on Manycore PIM Architectures;ACM Transactions on Embedded Computing Systems;2024-08-14

2. Load Balanced PIM-Based Graph Processing;ACM Transactions on Design Automation of Electronic Systems;2024-06-21

3. PhGraph: A High-Performance ReRAM-Based Accelerator for Hypergraph Applications;IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems;2023