Abstract
Abstract. This paper presents an application of GPU accelerators in Earth system modeling. We focus on atmospheric chemical kinetics, one of the most computationally intensive tasks in climate–chemistry model simulations. We developed a software package that automatically generates CUDA kernels to numerically integrate atmospheric chemical kinetics in the global climate model ECHAM/MESSy Atmospheric Chemistry (EMAC), used to study climate change and air quality scenarios. A source-to-source compiler outputs a CUDA-compatible kernel by parsing the FORTRAN code generated by the Kinetic PreProcessor (KPP) general analysis tool. All Rosenbrock methods that are available in the KPP numerical library are supported.Performance evaluation, using Fermi and Pascal CUDA-enabled GPU accelerators, shows achieved speed-ups of 4. 5 × and 20. 4 × , respectively, of the kernel execution time. A node-to-node real-world production performance comparison shows a 1. 75 × speed-up over the non-accelerated application using the KPP three-stage Rosenbrock solver. We provide a detailed description of the code optimizations used to improve the performance including memory optimizations, control code simplification, and reduction of idle time. The accuracy and correctness of the accelerated implementation are evaluated by comparing to the CPU-only code of the application. The median relative difference is found to be less than 0.000000001 % when comparing the output of the accelerated kernel the CPU-only code.The approach followed, including the computational workload division, and the developed GPU solver code can potentially be used as the basis for hardware acceleration of numerous geoscientific models that rely on KPP for atmospheric chemical kinetics applications.
Reference27 articles.
1. Alvanos, M., and Christoudias, T.: MEDINA: MECCA development in accelerators – KPP Fortran to CUDA source-to-source Pre-processor, J. Open Res. Softw., 5, https://doi.org/10.5334/jors.158, 2017a.
2. Alvanos, M., and Christoudias, T.: MECCA – KPP Fortran to CUDA source-to-source pre-processor, available at: https://doi.org/10.5281/zenodo.546811, 2017b.
3. Christou, M., Christoudias, T., Morillo, J., Alvarez, D., and Merx, H.: Earth system modelling on system-level heterogeneous architectures: EMAC (version 2.42) on the Dynamical Exascale Entry Platform (DEEP), Geosci. Model Dev., 9, 3483–3491, https://doi.org/10.5194/gmd-9-3483-2016, 2016.
4. Christoudias, T., and Alvanos, M.: Accelerated chemical kinetics in the EMAC chemistry-climate model, in: High Performance Computing &Simulation (HPCS), 2016 International Conference on, IEEE, Innsbruck, Austria, 886–889, https://doi.org/10.1109/HPCSim.2016.7568427, 2016.
5. Corden, M. J., and Kreitzer, D.: Consistency of Floating-Point Results using the Intel® Compiler, Intel Corporation, 2012.
Cited by
12 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献