The ESCAPE project: Energy-efficient Scalable Algorithms for Weather Prediction at Exascale
-
Published:2019-10-22
Issue:10
Volume:12
Page:4425-4441
-
ISSN:1991-9603
-
Container-title:Geoscientific Model Development
-
language:en
-
Short-container-title:Geosci. Model Dev.
Author:
Müller Andreas, Deconinck Willem, Kühnlein ChristianORCID, Mengaldo Gianmarco, Lange Michael, Wedi Nils, Bauer PeterORCID, Smolarkiewicz Piotr K., Diamantakis Michail, Lock Sarah-JaneORCID, Hamrud Mats, Saarinen Sami, Mozdzynski George, Thiemert Daniel, Glinton Michael, Bénard Pierre, Voitus Fabrice, Colavolpe Charles, Marguinaud Philippe, Zheng Yongjun, Van Bever Joris, Degrauwe Daan, Smet Geert, Termonia Piet, Nielsen Kristian P., Sass Bent H., Poulsen Jacob W., Berg Per, Osuna Carlos, Fuhrer OliverORCID, Clement Valentin, Baldauf MichaelORCID, Gillard Mike, Szmelter JoannaORCID, O'Brien Enda, McKinstry Alastair, Robinson Oisín, Shukla Parijat, Lysaght Michael, Kulczewski Michał, Ciznicki Milosz, Piątek Wojciech, Ciesielski Sebastian, Błażewicz Marek, Kurowski Krzysztof, Procyk Marcin, Spychala Pawel, Bosak Bartosz, Piotrowski Zbigniew P.ORCID, Wyszogrodzki AndrzejORCID, Raffin ErwanORCID, Mazauric Cyril, Guibert DavidORCID, Douriez Louis, Vigouroux XavierORCID, Gray Alan, Messmer Peter, Macfaden Alexander J., New Nick
Abstract
Abstract. In the simulation of complex multi-scale flows arising in weather and climate modelling, one of the biggest challenges is to satisfy strict service requirements in terms of time to solution and to satisfy budgetary constraints in terms of energy to solution, without compromising the accuracy and stability of the application. These simulations require algorithms that minimise the energy footprint along with the time required to produce a solution, maintain the physically required level of accuracy, are numerically stable, and are resilient in case of hardware failure. The European Centre for Medium-Range Weather Forecasts (ECMWF) led the ESCAPE (Energy-efficient Scalable Algorithms for Weather Prediction at Exascale) project, funded by Horizon 2020 (H2020) under the FET-HPC (Future and Emerging Technologies in High Performance Computing) initiative. The goal of ESCAPE was to develop a sustainable strategy to evolve weather and climate prediction models to next-generation computing technologies. The project partners incorporate the expertise of leading European regional forecasting consortia, university research, experienced high-performance computing centres, and hardware vendors. This paper presents an overview of the ESCAPE strategy: (i) identify domain-specific key algorithmic motifs in weather prediction and climate models (which we term Weather & Climate Dwarfs), (ii) categorise them in terms of computational and communication patterns while (iii) adapting them to different hardware architectures with alternative programming models, (iv) analyse the challenges in optimising, and (v) find alternative algorithms for the same scheme. The participating weather prediction models are the following: IFS (Integrated Forecasting System); ALARO, a combination of AROME (Application de la Recherche à l'Opérationnel à Meso-Echelle) and ALADIN (Aire Limitée Adaptation Dynamique Développement International); and COSMO–EULAG, a combination of COSMO (Consortium for Small-scale Modeling) and EULAG (Eulerian and semi-Lagrangian fluid solver). For many of the weather and climate dwarfs ESCAPE provides prototype implementations on different hardware architectures (mainly Intel Skylake CPUs, NVIDIA GPUs, Intel Xeon Phi, Optalysys optical processor) with different programming models. The spectral transform dwarf represents a detailed example of the co-design cycle of an ESCAPE dwarf. The dwarf concept has proven to be extremely useful for the rapid prototyping of alternative algorithms and their interaction with hardware; e.g. the use of a domain-specific language (DSL). Manual adaptations have led to substantial accelerations of key algorithms in numerical weather prediction (NWP) but are not a general recipe for the performance portability of complex NWP models. Existing DSLs are found to require further evolution but are promising tools for achieving the latter. Measurements of energy and time to solution suggest that a future focus needs to be on exploiting the simultaneous use of all available resources in hybrid CPU–GPU arrangements.
Funder
H2020 Future and Emerging Technologies
Publisher
Copernicus GmbH
Reference52 articles.
1. Asanović, K., Bodik, R., Catanzaro, B. C., Gebis, J. J., Husbands, P.,
Keutzer, K., Patterson, D. A., Plishker, W. L., Shalf, J., Williams, S. W.,
and Yelick, K. A.: The landscape of parallel computing research: a view from
Berkeley, Tech. Rep. UCB/EECS-2006-183, EECS Department, University of
California, Berkeley,
available at: http://www2.eecs.berkeley.edu/Pubs/TechRpts/2006/EECS-2006-183.pdf (last access: 27 September 2019),
2006. a 2. Asanović, K., Wawrzynek, J., Wessel, D., Yelick, K., Bodik, R., Demmel, J.,
Keaveny, T., Keutzer, K., Kubiatowicz, J., Morgan, N., Patterson, D., and
Sen, K.: A view of the parallel computing landscape, Comm. ACM, 52, 56–67,
https://doi.org/10.1145/1562764.1562783, 2009. a 3. Bauer, P., Thorpe, A., and Brunet, G.: The quiet revolution of numerical
weather prediction, Nature, 525, 47–55, https://doi.org/10.1038/nature14956, 2015. a 4. Bénard, P. and Glinton, M.: Circumventing the pole problem of reduced
lat-lon grids with local schemes. Part I: analysis and model formulation,
Q. J. Roy, Meteor. Soc., 145, 1377–1391, https://doi.org/10.1002/qj.3509, 2019. a 5. Clement, V., Ferrachat, S., Fuhrer, O., Lapillonne, X., Osuna, C. E., Pincus,
R., Rood, J., and Sawyer, W.: The CLAW DSL, in: Proceedings of the
Platform for Advanced Scientific Computing Conference – PASC'18, ACM Press, https://doi.org/10.1145/3218176.3218226, 2018. a
Cited by
17 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|