A Survey of Accelerating Parallel Sparse Linear Algebra-Reference-Cited by-同舟云学术

A Survey of Accelerating Parallel Sparse Linear Algebra

Published:2023-08-28 Issue:1 Volume:56 Page:1-38
ISSN:0360-0300
Container-title:ACM Computing Surveys
language:en
Short-container-title:ACM Comput. Surv.

Author:

Xiao Guoqing¹^ORCID,Yin Chuanghui²^ORCID,Zhou Tao²^ORCID,Li Xueqi²^ORCID,Chen Yuedan³^ORCID,Li Kenli²^ORCID

Affiliation:

1. College of Computer Science and Electronic Engineering, Hunan University, China and Shenzhen Research Institute, Hunan University, China

2. College of Computer Science and Electronic Engineering, Hunan University, China

3. Big Data Institute, Central South University, China

Abstract

Sparse linear algebra includes the fundamental and important operations in various large-scale scientific computing and real-world applications. There exists performance bottleneck for sparse linear algebra since it mainly contains the memory-bound computations with low arithmetic intensity. How to improve its performance has increasingly become a focus of research efforts. Using parallel computing techniques to accelerate sparse linear algebra is currently the most popular method, while facing various challenges, e.g., large-scale data brings difficulties in storage, and the sparsity of data leads to irregular memory accesses and parallel load imbalance. Therefore, this article provides a comprehensive overview on acceleration of sparse linear algebra operations using parallel computing platforms, where we focus on four main classifications: sparse matrix-vector multiplication (SpMV), sparse matrix-sparse vector multiplication (SpMSpV), sparse general matrix-matrix multiplication (SpGEMM), and sparse tensor algebra. The takeaways from this article include the following: understanding the challenges of accelerating linear sparse algebra on various hardware platforms; understanding how structured data sparsity can improve storage efficiency; understanding how to optimize parallel load balance; understanding how to improve the efficiency of memory accesses; understanding how do the adaptive frameworks automatically select the optimal algorithms; and understanding recent design trends for acceleration of parallel sparse linear algebra.

Funder

Key-Area R&D Program of Guangdong Province

Programs of the National Natural Science Foundation of China

Outstanding Youth Funding of CCF-HUAWEI

Programs of Hunan Province

Program of the Natural Science Foundation of Guangdong Province

Program of Fundamental Research of Shenzhen

Open Research Project of Zhejiang Lab

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science,Theoretical Computer Science