Enhancing computational efficiency in 3-D seismic modelling with half-precision floating-point numbers based on the curvilinear grid finite-difference method

Author:

Wan Jialiang1ORCID,Wang Wenqiang2ORCID,Zhang Zhenguo13ORCID

Affiliation:

1. Department of Earth and Space Sciences, Southern University of Science and Technology , Shenzhen 518055 , China

2. High Performance Computing Department, National Supercomputing Center in Shenzhen , Shenzhen 518055 , China

3. Guangdong Provincial Key Laboratory of Geophysical High-resolution Imaging Technology, Southern University of Science and Technology , Shenzhen 518055 , China

Abstract

SUMMARY Large-scale and high-resolution seismic modelling are very significant to simulating seismic waves, evaluating earthquake hazards and advancing exploration seismology. However, achieving high-resolution seismic modelling requires substantial computing and storage resources, resulting in a considerable computational cost. To enhance computational efficiency and performance, recent heterogeneous computing platforms, such as Nvidia Graphics Processing Units (GPUs), natively support half-precision floating-point numbers (FP16). FP16 operations can provide faster calculation speed, lower storage requirements and greater performance enhancement over single-precision floating-point numbers (FP32), thus providing significant benefits for seismic modelling. Nevertheless, the inherent limitation of fewer 16-bit representations in FP16 may lead to severe numerical overflow, underflow and floating-point errors during computation. In this study, to ensure stable wave equation solutions and minimize the floating-point errors, we use a scaling strategy to adjust the computation of FP16 arithmetic operations. For optimal GPU floating-point performance, we implement a 2-way single instruction multiple data (SIMD) within the floating-point units (FPUs) of CUDA cores. Moreover, we implement an earthquake simulation solver for FP16 operations based on curvilinear grid finite-difference method (CGFDM) and perform several earthquake simulations. Comparing the results of wavefield data with the standard CGFDM using FP32, the errors introduced by FP16 are minimal, demonstrating excellent consistency with the FP32 results. Performance analysis indicates that FP16 seismic modelling exhibits a remarkable improvement in computational efficiency, achieving a speedup of approximately 1.75 and reducing memory usage by half compared to the FP32 version.

Funder

National Natural Science Foundation of China

Shenzhen Science and Technology Innovation Program

Southern University of Science and Technology

Publisher

Oxford University Press (OUP)

Reference43 articles.

1. Accelerating scientific computations with mixed precision algorithms;Baboulin;Comput. Phys. Commun.,2009

2. High-precision floating-point arithmetic in scientific computation;Bailey;Comput. Sci. Eng.,2005

3. EDGE: Extreme scale fused seismic simulations with the discontinuous Galerkin method;Breuer,2017

4. A systematic and efficient method of computing normal modes for multilayered half-space;Chen;Geophys. J. Int.,1993

5. Seismogram synthesis in multi-layered half-space part I. theoretical formulations;Chen;Earthq. Res. China,1999

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3