FAMOUS, faster: using parallel computing techniques to accelerate the FAMOUS/HadCM3 climate model with a focus on the radiative transfer algorithm
-
Published:2011-09-27
Issue:3
Volume:4
Page:835-844
-
ISSN:1991-9603
-
Container-title:Geoscientific Model Development
-
language:en
-
Short-container-title:Geosci. Model Dev.
Author:
Hanappe P.,Beurivé A.,Laguzet F.,Steels L.,Bellouin N.,Boucher O.,Yamazaki Y. H.,Aina T.,Allen M.
Abstract
Abstract. We have optimised the atmospheric radiation algorithm of the FAMOUS climate model on several hardware platforms. The optimisation involved translating the Fortran code to C and restructuring the algorithm around the computation of a single air column. Instead of the existing MPI-based domain decomposition, we used a task queue and a thread pool to schedule the computation of individual columns on the available processors. Finally, four air columns are packed together in a single data structure and computed simultaneously using Single Instruction Multiple Data operations. The modified algorithm runs more than 50 times faster on the CELL's Synergistic Processing Element than on its main PowerPC processing element. On Intel-compatible processors, the new radiation code runs 4 times faster. On the tested graphics processor, using OpenCL, we find a speed-up of more than 2.5 times as compared to the original code on the main CPU. Because the radiation code takes more than 60 % of the total CPU time, FAMOUS executes more than twice as fast. Our version of the algorithm returns bit-wise identical results, which demonstrates the robustness of our approach. We estimate that this project required around two and a half man-years of work.
Publisher
Copernicus GmbH
Reference18 articles.
1. Asanovic, K., Bodik, R., Demmel, J., Keaveny, T., Keutzer, K., Kubiatowicz, J., Morgan, N., Patterson, D., Sen, K., Wawrzynek, J., Wessel, D., and Yelick, K.: A view of the parallel computing landscape, Communications of the ACM, 52, 56–67, https://doi.org/10.1145/1562764.1562783, 2009. 2. Backus, J.: Can Programming Be Liberated from the von Neumann Style? A Functional Style and Its Algebra of Programs, Communications of the ACM, 21, 613–641, https://doi.org/10.1145/1283920.1283933, 1978. 3. Chafi, H., DeVito, Z., Moors, A., Rompf, T., Sujeeth, A. K., Hanrahan, P., Odersky, M., and Olukotun, K.: Language virtualization for heterogeneous parallel computing, in: OOPSLA '10 Proceedings of the ACM international conference on Object oriented programming systems languages and applications, pp. 835–847, ACM, New York, NY, USA, https://doi.org/10.1145/1869459.1869527, 2010. 4. Dersch, H.: Universal SIMD-Mathlibrary, Tech. rep., Furtwangen University of Applied Sciences, \\urlprefixhttp://webuser.fh-furtwangen.de/ dersch/libsimdmath.pdf, 2008. 5. Easterbrook, S. M. and Johns, T.: Engineering the Software for Understanding Climate Change, IEEE Comput. Sci. Eng, 11, 65–74, https://doi.org/10.1109/MCSE.2009.193, 2009.
Cited by
13 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|