Affiliation:
1. Depto. de Ingeniería y Ciencia de Computadores Universitat Jaume I Castellón de la Plana Spain
2. Steinbuch Centre for Computing Karlsruhe Institute of Technology Karlsruhe Germany
3. Innovative Computing Lab University of Tennessee Knoxville (Tennessee) USA
4. Depto. de Informática de Sistemas y Computadores Universitat Politècnica de València Valencia Spain
5. Depto. de Informática Universitat de València Valencia Spain
Abstract
SummaryMany practical algorithms for numerical rank computations implement an iterative procedure that involves repeated multiplications of a vector, or a collection of vectors, with both a sparse matrix and its transpose. Unfortunately, the realization of these sparse products on current high performance libraries often deliver much lower arithmetic throughput when the matrix involved in the product is transposed. In this work, we propose a hybrid sparse matrix layout, named CSRC, that combines the flexibility of some well‐known sparse formats to offer a number of appealing properties: (1) CSRC can be obtained at low cost from the popular CSR (compressed sparse row) format; (2) CSRC has similar storage requirements as CSR; and especially, (3) the implementation of the sparse product kernels delivers high performance for both the direct product and its transposed variant on modern graphics accelerators thanks to a significant reduction of atomic operations compared to a conventional implementation based on CSR. This solution thus renders considerably higher performance when integrated into an iterative algorithm for the truncated singular value decomposition (SVD), such as the randomized SVD or, as demonstrated in the experimental results, the block Golub–Kahan–Lanczos algorithm.
Funder
Office of Science
Spanish National Plan for Scientific and Technical Research and Innovation
Universitat Jaume I
Subject
Computational Theory and Mathematics,Computer Networks and Communications,Computer Science Applications,Theoretical Computer Science,Software
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Special Issue on the pervasive nature of HPC (PN‐HPC);Concurrency and Computation: Practice and Experience;2024-01-14