Efficient Distributed Matrix-free Multigrid Methods on Locally Refined Meshes for FEM Computations

Author:

Munch Peter1ORCID,Heister Timo2ORCID,Prieto Saavedra Laura3ORCID,Kronbichler Martin4ORCID

Affiliation:

1. Institute of Mathematics, University of Augsburg, Germany and Institute of Material Systems Modeling, Helmholtz-Zentrum Hereon, Germany and Institute for Computational Mechanics, Technical University of Munich, München, Germany

2. Mathematical and Statistical Sciences, Clemson University, Clemson, South Carolina, USA

3. Department of Chemical Engineering, Polytechnique Montréal, Montreal, QC, Canada

4. Institute of Mathematics, University of Augsburg and Department of Information Technology, Uppsala University, Uppsala, Sweden

Abstract

This work studies three multigrid variants for matrix-free finite-element computations on locally refined meshes: geometric local smoothing, geometric global coarsening (both h -multigrid), and polynomial global coarsening (a variant of p -multigrid). We have integrated the algorithms into the same framework—the open source finite-element library deal.II —, which allows us to make fair comparisons regarding their implementation complexity, computational efficiency, and parallel scalability as well as to compare the measurements with theoretically derived performance metrics. Serial simulations and parallel weak and strong scaling on up to 147,456 CPU cores on 3,072 compute nodes are presented. The results obtained indicate that global-coarsening algorithms show a better parallel behavior for comparable smoothers due to the better load balance, particularly on the expensive fine levels. In the serial case, the costs of applying hanging-node constraints might be significant, leading to advantages of local smoothing, even though the number of solver iterations needed is slightly higher. When using p - and h -multigrid in sequence ( hp -multigrid), the results indicate that it makes sense to decrease the degree of the elements first from a performance point of view due to the cheaper transfer.

Funder

Bayerisches Kompetenznetzwerk für Technisch-Wissenschaftliches Hoch- und Höchstleistungsrechnen

Performance tuning of high-order discontinuous Galerkin solvers for SuperMUC-NG

National Science Foundation

Computational Infrastructure in Geodynamics initiative

The University of California – Davis, and by Technical Data Analysis, Inc.

Publisher

Association for Computing Machinery (ACM)

Subject

Computational Theory and Mathematics,Computer Science Applications,Hardware and Architecture,Modeling and Simulation,Software

Cited by 5 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Cache-optimized and low-overhead implementations of additive Schwarz methods for high-order FEM multigrid computations;The International Journal of High Performance Computing Applications;2023-12-06

2. Perspectives and Experiences Supporting Containers for Research Computing at the Texas Advanced Computing Center;Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis;2023-11-12

3. The deal.II Library, Version 9.5;Journal of Numerical Mathematics;2023-08-22

4. Stage-Parallel Fully Implicit Runge–Kutta Implementations with Optimal Multilevel Preconditioners at the Scaling Limit;SIAM Journal on Scientific Computing;2023-07-18

5. Efficient Application of Hanging-Node Constraints for Matrix-Free High-Order FEM Computations on CPU and GPU;Lecture Notes in Computer Science;2022

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3