TweTriS: Twenty trillion-atom simulation

Author:

Tchipev Nikola1,Seckler Steffen1,Heinen Matthias2,Vrabec Jadran2,Gratl Fabio1,Horsch Martin34,Bernreuther Martin5,Glass Colin W6,Niethammer Christoph5,Hammer Nicolay7,Krischok Bernd5,Resch Michael58,Kranzlmüller Dieter7,Hasse Hans9,Bungartz Hans-Joachim17,Neumann Philipp10

Affiliation:

1. Department of Informatics, Technical University of Munich, Garching, Germany

2. Thermodynamics and Process Engineering, Technical University Berlin, Berlin, Germany

3. Engineering Department, American University of Iraq, Sulaimani (AUIS), Sulaimani, Iraq

4. Scientific Computing Department, STFC Daresbury Laboratory, Warrington, UK

5. High Performance Computing Center Stuttgart (HLRS), Stuttgart, Germany

6. Mechanical Engineering, Helmut Schmidt University, Hamburg, Germany

7. Leibniz Supercomputing Centre, Garching, Germany

8. Institute for High-Performance Computing, University of Stuttgart, Stuttgart, Germany

9. Laboratory of Engineering Thermodynamics (LTD), TU Kaiserslautern, Kaiserslautern, Germany

10. Department of Informatics, Universität Hamburg, Hamburg, Germany

Abstract

Significant improvements are presented for the molecular dynamics code ls1 mardyn — a linked cell-based code for simulating a large number of small, rigid molecules with application areas in chemical engineering. The changes consist of a redesign of the SIMD vectorization via wrappers, MPI improvements and a software redesign to allow memory-efficient execution with the production trunk to increase portability and extensibility. Two novel, memory-efficient OpenMP schemes for the linked cell-based force calculation are presented, which are able to retain Newton’s third law optimization. Comparisons to well-optimized Verlet list-based codes, such as LAMMPS and GROMACS, demonstrate the viability of the linked cell-based approach. The present version of ls1 mardyn is used to run simulations on entire supercomputers, maximizing the number of sampled atoms. Compared to the preceding version of ls1 mardyn on the entire set of 9216 nodes of SuperMUC, Phase 1, 27% more atoms are simulated. Weak scaling performance is increased by up to 40% and strong scaling performance by up to more than 220%. On Hazel Hen, strong scaling efficiency of up to 81% and 189 billion molecule updates per second is attained, when scaling from 8 to 7168 nodes. Moreover, a total of 20 trillion atoms is simulated at up to 88% weak scaling efficiency running at up to 1.33 PFLOPS. This represents a fivefold increase in terms of the number of atoms simulated to date.

Funder

Task-based load balancing and auto-tuning in particle simulations

Publisher

SAGE Publications

Subject

Hardware and Architecture,Theoretical Computer Science,Software

Cited by 47 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Using Autonomous Outlier Detection Methods for Thermophysical Property Data;Journal of Chemical & Engineering Data;2024-01-12

2. The Error-Energy Tradeoff in Molecular and Molecular-Continuum Fluid Simulations;Proceedings of the International Conference on High Performance Computing in Asia-Pacific Region Workshops;2024-01-11

3. Recent advances in Accelerated Molecular Dynamics Methods: Theory and Applications;Comprehensive Computational Chemistry;2024

4. High-Performance Computing as a Key to New Insights into Thermodynamics;High Performance Computing in Science and Engineering '22;2024

5. GPU-based molecular dynamics of fluid flows: Reaching for turbulence;The International Journal of High Performance Computing Applications;2023-12-08

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3