Author:
Brodtkorb André R.,Sætra Martin L.
Abstract
GPUs have become a household name in High Performance Computing (HPC) systems over the last 15 years. However, programming GPUs is still largely a manual and arduous task, which requires expert knowledge of the physics, mathematics, and computer science involved. Even though there have been large advances in automatic parallelization and GPU execution of serial code, it is still difficult to fully utilize the GPU hardware with such approaches. Many core numeric GPU codes are therefore still mostly written using low level C/C++ or Fortran for the host code. Several studies have shown that using higher level languages, such as Python, can make software development faster and with fewer bugs. We have developed a simulator based on PyCUDA and mpi4py in Python for solving the Euler equations on Cartesian grids. Our framework utilizes the GPU, and can automatically run on clusters using MPI as well as on shared-memory systems. Our framework allows the programmer to implement low-level details in CUDA C/C++, which is important to achieve peak performance, whilst still benefiting from the productivity of Python. We show that our framework achieves good weak and strong scaling. Our weak scaling achieves more than 94% efficiency on a shared-memory GPU system and more than 90% efficiency on a distributed-memory GPU system, and our strong scaling is close to perfect on both shared-memory and distributed-memory GPU systems.
Subject
Physical and Theoretical Chemistry,General Physics and Astronomy,Mathematical Physics,Materials Science (miscellaneous),Biophysics
Reference29 articles.
1. Fast matrix multiplies using graphics hardware;Larsen,2001
2. A survey of general-purpose computation on graphics hardware;Owens;Computer Graphics Forum,2007
3. GPU computing;Owens;Proc IEEE,2008
4. State-of-the-art in heterogeneous computing;Brodtkorb;Scientific Programming,2010
5. Graphics processing unit (GPU) programming strategies and trends in GPU computing;Brodtkorb;J Parallel Distributed Comput,2013