Affiliation:
1. School of Electrical Engineering, University of Belgrade, Bulevar Kralja Aleksandra 73, 11000 Belgrade, Serbia
Abstract
Heterogeneous architectures consisting of both central processing units and graphics processing units are common in contemporary computer systems. For that reason, several programming models have been developed to exploit available parallelism, such as low-level CUDA and OpenCL, and directive-based OpenMP and OpenACC. In this paper we explore and evaluate the applicability of OpenACC, which is a directive-based programming model for GPUs. We focus both on the performance and programming effort needed to parallelize the existing sequential algorithms for GPU execution. The evaluation is based on the benchmark suite Parboil, which consists of 11 different mini-applications from different scientific domains, both compute- and memory-bound. The results show that mini-apps parallelized with OpenACC can achieve significant speedups over sequential implementations and in some cases, even outperform CUDA implementations. Furthermore, there is less of a programming effort compared to low-level models, such as CUDA and OpenCL, because a majority of the work is left to the compiler and overall, the code needs less restructuring.
Funder
Ministry of Science, Technological Development and Innovation of the Republic of Serbia
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference48 articles.
1. Mišić, M.J., Đurđević, Đ.M., and Tomašević, M.V. (2012, January 21–25). Evolution and trends in GPU computing. Proceedings of the 2012 35th International Convention MIPRO, Opatija, Croatia.
2. A survey on parallel computing and its applications in data-parallel problems using GPU architectures;Navarro;Commun. Comput. Phys.,2014
3. A survey of GPU-based acceleration techniques in MRI reconstructions;Wang;Quant. Imaging Med. Surg.,2018
4. A survey of graph processing on graphics processing units;Tran;J. Supercomput.,2018
5. Linear solvers for power grid optimization problems: A review of GPU-accelerated linear solvers;Darve;Parallel Comput.,2022
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Towards a GPU-Parallelization of the neXtSIM-DG Dynamical Core;Proceedings of the Platform for Advanced Scientific Computing Conference;2024-06-03