Accelerating Graph Analytics Using Attention-Based Data Prefetcher-Reference-Cited by-同舟云学术

Accelerating Graph Analytics Using Attention-Based Data Prefetcher

Published:2024-06-13 Issue:5 Volume:5 Page:
ISSN:2661-8907
Container-title:SN Computer Science
language:en
Short-container-title:SN COMPUT. SCI.

Author:

Zhang Pengmiao^ORCID,Kannan Rajgopal,Nori Anant V.,Prasanna Viktor K.

Abstract

AbstractGraph analytics shows promise for solving challenging problems on relational data. However, memory constraints arise from the large size of graphs and the high complexity of algorithms. Data prefetching is a crucial technique to hide memory access latency by predicting and fetching data into the memory cache beforehand. Traditional prefetchers struggle with fixed rules in adapting to complex memory access patterns in graph analytics. Machine learning (ML) algorithms, particularly long short-term memory (LSTM) models, excel in memory access prediction. However, they encounter challenges such as difficulty in learning interleaved access patterns and high storage costs when predicting in large memory address space. In addition, there remains a gap between designing a high-performance ML-based memory access predictor and developing an effective ML-based prefetcher for an existing memory system. In this work, we propose a novel Attention-based prefetching framework to accelerate graph analytics applications. To achieve high-performance memory access prediction, we propose A2P, a novel Attention-based memory Access Predictor for graph analytics. We use the multi-head self-attention mechanism to extract features from memory traces. We design a novel bitmap labeling method to collect future deltas within a spatial range, making interleaved patterns easier to learn. We introduce a novel super page concept, allowing the model to surpass physical page constraints. To integrate A2P into a memory system, we design a three-module prefetching framework composed of an existing memory hierarchy, a prefetch controller, and the predictor A2P. In addition, we propose a hybrid design to combine A2P and existing hardware prefetchers for higher prefetching performance. We evaluate A2P and the prefetching framework using the widely used GAP benchmark. Prediction experiments show that for the top three predictions, A2P outperforms the widely used state-of-the-art LSTM-based model by 23.1% w.r.t. Precision, 21.2% w.r.t. Recall, and 10.4% w.r.t. Coverage. Prefetching experiments show that A2P provides 18.4% IPC Improvement on average, outperforming state-of-the-art prefetchers BO by 17.2%, ISB by 15.0%, and Delta-LSTM by 10.9%. The hybrid prefetcher combining A2P and ISB achieves 21.7% IPC Improvement, outperforming the hybrid of BO and ISB by 16.3%.

Funder

National Science Foundation

University of Southern California

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s42979-024-02989-w.pdf

Reference63 articles.

1. Lakhotia K, Kannan R, Pati S, Prasanna V. Gpop: a scalable cache-and memory-efficient framework for graph processing over parts. ACM Trans Parallel Comput (TOPC). 2020;7(1):1–24.

2. Drosou A, Kalamaras I, Papadopoulos S, Tzovaras D. An enhanced graph analytics platform (gap) providing insight in big network data. J Innov Digit Ecosyst. 2016;3(2):83–97.

3. Basak A, Li S, Hu X, Oh SM, Xie X, Zhao L, Jiang X, Xie Y. Analysis and optimization of the memory hierarchy for graph processing workloads. In: 2019 IEEE international symposium on high performance computer architecture (HPCA). IEEE; 2019. p. 373–86.

4. Malewicz G, Austern MH, Bik AJ, Dehnert JC, Horn I, Leiser N, Czajkowski G. Pregel: a system for large-scale graph processing. In: Proceedings of the 2010 ACM SIGMOD international conference on management of data. ACM; 2010. p. 135–46.

5. Han M, Daudjee K. Giraph unchained: barrierless asynchronous parallel execution in pregel-like graph processing systems. Proc VLDB Endow. 2015;8(9):950–61.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An innovative muted ant colony optimization (MAPO) controlling for grid PV system;International Journal of Information Technology;2024-09-13