Abstract
Abstract. With semiconductor technology gradually approaching its physical
and thermal limits, graphics processing units (GPUs) are becoming an
attractive solution for many scientific applications due to their high
performance. This paper presents an application of GPU accelerators in an
air quality model. We demonstrate an approach that runs a piecewise parabolic method (PPM) solver of
horizontal advection (HADVPPM) for the air quality model CAMx on GPU
clusters. Specifically, we first convert the HADVPPM to a new Compute
Unified Device Architecture C (CUDA C) code to make it computable on the GPU
(GPU-HADVPPM). Then, a series of optimization measures are taken, including
reducing the CPU–GPU communication frequency, increasing the data size
computation on the GPU, optimizing the GPU memory access, and using thread
and block indices to improve the overall computing performance of the CAMx
model coupled with GPU-HADVPPM (named the CAMx-CUDA model). Finally, a
heterogeneous, hybrid programming paradigm is presented and utilized with
GPU-HADVPPM on the GPU clusters with a message passing interface (MPI)
and CUDA. The offline experimental results show that running GPU-HADVPPM on
one NVIDIA Tesla K40m and an NVIDIA Tesla V100 GPU can achieve up to a
845.4× and 1113.6× acceleration. By implementing a series of optimization
schemes, the CAMx-CUDA model results in a 29.0× and 128.4× improvement in
computational efficiency by using a GPU accelerator card on a K40m and V100
cluster, respectively. In terms of the single-module computational
efficiency of GPU-HADVPPM, it can achieve 1.3× and 18.8× speedup on an
NVIDIA Tesla K40m GPU and NVIDIA Tesla V100 GPU, respectively. The multi-GPU
acceleration algorithm enables a 4.5× speedup with eight CPU cores and eight GPU accelerators on a V100 cluster.
Funder
National Key Research and Development Program of China
Reference31 articles.
1. Bleichrodt, F., Bisseling, R. H., and Dijkstra, H. A.: Accelerating a
barotropic ocean model using a GPU, Ocean Model., 41, 16–21,
https://doi.org/10.1016/j.ocemod.2011.10.001, 2012.
2. Cao, K., Wu, Q., Wang, L., Wang, N., Cheng, H., Tang, X., Li, D., and Wang,
L.: The dataset of the manuscript “GPU-HADVPPM V1.0: high-efficient parallel
GPU design of the Piecewise Parabolic Method (PPM) for horizontal advection
in air quality model (CAMx V6.10)”, Zenodo [data set],
https://doi.org/10.5281/zenodo.7765218, 2023.
3. Colella, P. and Woodward, P. R.: The Piecewise Parabolic Method (PPM) for
gas-dynamical simulations, J. Comput. Phys., 54, 174–201,
https://doi.org/10.1016/0021-9991(84)90143-8, 1984.
4. ENVIRON: User Guide for Comprehensive Air Quality Model with Extensions
Version 6.1, https://camx-wp.azurewebsites.net/Files/CAMxUsersGuide_v6.10.pdf (last access: 19 December 2022), 2014.
5. ENVIRON: CAMx version 6.1, ENVIRON [code], available at: https://camx-wp.azurewebsites.net/download/source/, last access: 24 March 2023.
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献