Author:
Herrmann Nina,Dieckmann Justus,Kuchen Herbert
Abstract
AbstractComplex algorithms and enormous data sets require parallel execution of programs to attain results in a reasonable amount of time. Both aspects are combined in the domain of three-dimensional stencil operations, for example, computational fluid dynamics. This work contributes to the research on high-level parallel programming by discussing the generalizable implementation of a three-dimensional stencil skeleton that works in heterogeneous computing environments. Two exemplary programs, a gas simulation with the Lattice Boltzmann method, and a mean blur, are executed in a multi-node multi-graphics processing units environment, proving the runtime improvements in heterogeneous computing environments compared to a sequential program.
Publisher
Springer Science and Business Media LLC
Reference39 articles.
1. MPI Standard: https://www.mpi-forum.org/docs/. Accessed 24 Feb 2023
2. The OpenMP API specification for parallel programming. https://www.openmp.org/. Accessed 24 Feb 2023
3. NVIDIA: CUDA: https://developer.nvidia.com/cuda-zone. Accessed 24 Feb 2023
4. Cole, M.I.: Algorithmic skeletons: structured management of parallel computation. Computer science thesis. Pitman, London (1989)
5. Ernsting, S., Kuchen, H.: Data parallel algorithmic skeletons with accelerator support. Int. J. Parallel Prog. 45(2), 283–299 (2017)