Multiple-precision matrix-vector multiplication on graphics processing units-Reference-Cited by-同舟云学术

Multiple-precision matrix-vector multiplication on graphics processing units

Published:2020-08-20 Issue:3 Volume:11 Page:61-84
ISSN:2079-3316
Container-title:Program Systems: Theory and Applications
language:ru
Short-container-title:ПСТП

Author:

Isupov Konstantin¹^ORCID,Knyazkov Vladimir²^ORCID

Affiliation:

1. Vyatka State University

2. Penza State University

Abstract

We are considering a parallel implementation of matrix-vector multiplication (GEMV, Level 2 of the BLAS) for graphics processing units (GPUs) using multiple-precision arithmetic based on the residue number system. In our GEMV implementation, element-wise operations with multiple-precision vectors and matrices consist of several parts, each of which is calculated by a separate CUDA kernel. This feature eliminates branch divergence when performing sequential parts of multiple-precision operations and allows the full utilization of the GPU’s resources. An efficient data structure for storing arrays with multiple-precision entries provides a coalesced access pattern to the GPU global memory. We have performed a rounding error analysis and derived error bounds for the proposed GEMV implementation. Experimental results show the high efficiency of the proposed solution compared to existing high-precision packages deployed on GPU.

Publisher

Ailamazyan Program Systems Institute of Russian Academy of Sciences (PSI RAS)

Subject

General Computer Science

Reference29 articles.

1. M. Courbariaux, Y. Bengio, J. David. Training deep neural networks with low precision multiplications, 2014.

2. High-Precision Arithmetic in Mathematical Physics

3. Numerical aspects of integration in semi-closed option pricing formulas for stochastic volatility jump diffusion models

4. The PSLQ algorithm for empirical data

5. Fast arbitrary order moments and arbitrary precision solution of the general rate model of column liquid chromatography with linear isotherm

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Active Traffic Signal Decisions Using Vector‐Matrix Multiplication;Advanced Intelligent Systems;2023-01-18