Abstract
AbstractGraphics processing unit accelerators have become a widespread technology on modern high performance computing clusters for increasing the performance of scientific computing algorithms. Despite early efforts to adopt linear solvers that utilize graphics processing units for OpenFOAM, to this date no widely accepted approach has gotten traction. In recent years, the number of different vendors providing graphics processing units accelerators has grown, and as of the writing of this paper, no commonly accepted, unified approach to leverage accelerators exists. This makes platform-portable solutions to increase the efficiency of graphics processing units offloading techniques desirable, and an important research topic. In this work, we investigate a platform-portable solution using the Ginkgo sparse linear algebra library.
Funder
German Federal Ministry of Education and Research
Karlsruher Institut für Technologie (KIT)
Publisher
Springer Science and Business Media LLC
Reference27 articles.
1. NVIDIA: CUDA C++ programming guide (2023)
2. AMD: ROCm-developer-tools/HIP. ROCm Developer Tools (2023)
3. Khronos: SYCL—C++ single-source heterogeneous programming for acceleration offload (2014)
4. Dagum L, Menon R (1998) Openmp: an industry standard API for shared-memory programming. IEEE Comput Sci Eng 5(1):46–55
5. Trott CR, Lebrun-Grandié D, Arndt D, Ciesko J, Dang V, Ellingwood N, Gayatri R, Harvey E, Hollman DS, Ibanez D, Liber N, Madsen J, Miles J, Poliakoff D, Powell A, Rajamanickam S, Simberg M, Sunderland D, Turcksin B, Wilke J (2022) Kokkos 3: programming model extensions for the exascale era. IEEE Trans Parallel Distrib Syst 33(4):805–817. https://doi.org/10.1109/TPDS.2021.3097283