Massive-Parallel Trajectory Calculations version 2.2 (MPTRAC-2.2): Lagrangian transport simulations on graphics processing units (GPUs)
-
Published:2022-04-05
Issue:7
Volume:15
Page:2731-2762
-
ISSN:1991-9603
-
Container-title:Geoscientific Model Development
-
language:en
-
Short-container-title:Geosci. Model Dev.
Author:
Hoffmann LarsORCID, Baumeister Paul F.ORCID, Cai Zhongyin, Clemens JanORCID, Griessbach SabineORCID, Günther GebhardORCID, Heng Yi, Liu MingzhaoORCID, Haghighi Mood KavehORCID, Stein Olaf, Thomas Nicole, Vogel BärbelORCID, Wu XueORCID, Zou LingORCID
Abstract
Abstract. Lagrangian models are fundamental tools to study atmospheric transport processes and for practical applications such as dispersion modeling for anthropogenic and natural emission sources. However, conducting large-scale Lagrangian transport simulations with millions of air parcels or more can become rather numerically costly. In this study, we assessed the potential of exploiting graphics processing units (GPUs) to accelerate Lagrangian transport simulations. We ported the Massive-Parallel Trajectory Calculations (MPTRAC) model to GPUs using the open accelerator (OpenACC) programming model. The trajectory calculations conducted within the MPTRAC model were fully ported to GPUs, i.e., except for feeding in the meteorological input data and for extracting the particle output data, the code operates entirely on the GPU devices without frequent data transfers between CPU and GPU memory. Model verification, performance analyses, and scaling tests of the Message Passing Interface (MPI) – Open Multi-Processing (OpenMP) – OpenACC hybrid parallelization of MPTRAC were conducted on the Jülich Wizard for European Leadership Science (JUWELS) Booster supercomputer operated by the Jülich Supercomputing Centre, Germany. The JUWELS Booster comprises 3744 NVIDIA A100 Tensor Core GPUs, providing a peak performance of 71.0 PFlop s−1. As of June 2021, it is the most powerful supercomputer in Europe and listed among the most energy-efficient systems internationally. For large-scale simulations comprising 108 particles driven by the European Centre for Medium-Range Weather Forecasts' fifth-generation reanalysis (ERA5), the performance evaluation showed a maximum speed-up of a factor of 16 due to the utilization of GPUs compared to CPU-only runs on the JUWELS Booster. In the large-scale GPU run, about 67 % of the runtime is spent on the physics calculations, conducted on the GPUs. Another 15 % of the runtime is required for file I/O, mostly to read the large ERA5 data set from disk. Meteorological data preprocessing on the CPUs also requires about 15 % of the runtime. Although this study identified potential for further improvements of the GPU code, we consider the MPTRAC model ready for production runs on the JUWELS Booster in its present form. The GPU code provides a much faster time to solution than the CPU code, which is particularly relevant for near-real-time applications of a Lagrangian transport model.
Funder
National Natural Science Foundation of China
Publisher
Copernicus GmbH
Reference93 articles.
1. Abalos, M., Randel, W. J., Kinnison, D. E., and Garcia, R. R.: Using the
Artificial Tracer e90 to Examine Present and Future UTLS Tracer Transport
in WACCM, J. Atmos. Sci., 74, 3383–3403, https://doi.org/10.1175/JAS-D-17-0135.1,
2017. a, b 2. Becker, A., Wotawa, G., De Geer, L.-E., Seibert, P., Draxler, R. R., Sloan,
C., D’Amours, R., Hort, M., Glaab, H., Heinrich, P., Grillon, Y.,
Shershakov, V., Katayama, K., Zhang, Y., Stewart, P., Hirtl, M., Jean, M.,
and Chen, P.: Global backtracking of anthropogenic radionuclides by means of
a receptor oriented ensemble dispersion modelling system in support of
Nuclear-Test-Ban Treaty verification, Atmos. Environ., 41, 4520–4534,
https://doi.org/10.1016/j.atmosenv.2006.12.048, 2007. a 3. Bergman, J. W., Fierli, F., Jensen, E. J., Honomichl, S., and Pan, L. L.:
Boundary layer sources for the Asian anticyclone: Regional contributions to
a vertical conduit, J. Geophys. Res., 118, 2560–2575,
https://doi.org/10.1002/jgrd.50142, 2013. a 4. Bowman, K. P. and Carrie, G. D.: The Mean-Meridional Transport Circulation of
the Troposphere in an Idealized GCM, J. Atmos. Sci., 59, 1502–1514,
https://doi.org/10.1175/1520-0469(2002)059<1502:TMMTCO>2.0.CO;2, 2002. a 5. Bowman, K. P., Lin, J. C., Stohl, A., Draxler, R., Konopka, P., Andrews, A.,
and Brunner, D.: Input Data Requirements for Lagrangian Trajectory Models,
B. Am. Meteorol. Soc., 94, 1051–1058, https://doi.org/10.1175/BAMS-D-12-00076.1, 2013. a
Cited by
12 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|