Data-driven Mixed Precision Sparse Matrix Vector Multiplication for GPUs-Reference-Cited by-同舟云学术

Data-driven Mixed Precision Sparse Matrix Vector Multiplication for GPUs

Published:2020-01-10 Issue:4 Volume:16 Page:1-24
ISSN:1544-3566
Container-title:ACM Transactions on Architecture and Code Optimization
language:en
Short-container-title:ACM Trans. Archit. Code Optim.

Author:

Ahmad Khalid¹,Sundar Hari¹,Hall Mary¹

Affiliation:

1. University of Utah, Salt Lake City, UT

Abstract

We optimize Sparse Matrix Vector multiplication (SpMV) using a mixed precision strategy (MpSpMV) for Nvidia V100 GPUs. The approach has three benefits: (1) It reduces computation time, (2) it reduces the size of the input matrix and therefore reduces data movement, and (3) it provides an opportunity for increased parallelism. MpSpMV’s decision to lower to single precision is data driven , based on individual nonzero values of the sparse matrix. On all real-valued matrices from the Sparse Matrix Collection, we obtain a maximum speedup of 2.61× and average speedup of 1.06× over double precision, while maintaining higher accuracy compared to single precision.

Funder

NSF

U.S. Government

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Information Systems,Software

Link

https://dl.acm.org/doi/pdf/10.1145/3371275

Reference35 articles.

1. A High Performance Block Eigensolver for Nuclear Configuration Interaction Calculations

Cited by 18 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Block-wise dynamic mixed-precision for sparse matrix-vector multiplication on GPUs;The Journal of Supercomputing;2024-03-11

2. Double Precision is not Necessary for LSQR for Solving Discrete Linear Ill-Posed Problems;Journal of Scientific Computing;2024-02-01

3. Adaptive Precision Sparse Matrix–Vector Product and Its Application to Krylov Solvers;SIAM Journal on Scientific Computing;2024-01-25

4. Reduced-Precision and Reduced-Exponent Formats for Accelerating Adaptive Precision Sparse Matrix–Vector Product;Lecture Notes in Computer Science;2024

5. Adaptive Lossy Data Compression Extended Architecture for Memory Bandwidth Conservation in SpMV;IEICE Transactions on Information and Systems;2023-12-01