Affiliation:
1. Barcelona Supercomputing Center, Barcelona, Spain
2. IBM T.J. Watson Research Center, Yorktown Heights, NY, USA
Abstract
Reverse-Time Migration (RTM) is a state-of-the-art technique in seismic acoustic imaging, because of the quality and integrity of the images it provides. Oil and gas companies trust RTM with crucial decisions on multi-million-dollar drilling investments. But RTM requires vastly more computational power than its predecessor techniques, and this has somewhat hindered its practical success. On the other hand, despite multi-core architectures promise to deliver unprecedented computational power, little attention has been devoted to mapping efficiently RTM to multi-cores. In this paper, we present a mapping of the RTM computational kernel to the IBM Cell/B.E. processor that reaches close-to-optimal performance. The kernel proves to be memory-bound and it achieves a 98% utilization of the peak memory bandwidth. Our Cell/B.E. implementation outperforms a traditional processor (PowerPC 970MP) in terms of performance (with an 15.0× speedup) and energy-efficiency (with a 10.0× increase in the GFlops/W delivered). Also, it is the fastest RTM implementation available to the best of our knowledge. These results increase the practical usability of RTM. Also, the RTM-Cell/B.E. combination proves to be a strong competitor in the seismic arena.
Subject
Computer Science Applications,Software
Cited by
16 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Bricks: A high-performance portability layer for computations on block-structured grids;The International Journal of High Performance Computing Applications;2024-08-19
2. BrickDL: Graph-Level Optimizations for DNNs with Fine-Grained Data Blocking on GPUs;Proceedings of the 53rd International Conference on Parallel Processing;2024-08-12
3. Efficient Large Scale Reverse-time Migration Imaging Computation based on Distributed Spark Cluster with GPUs;2023 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom);2023-12-21
4. Scalable Distributed High-Order Stencil Computations;SC22: International Conference for High Performance Computing, Networking, Storage and Analysis;2022-11
5. Exploiting reuse and vectorization in blocked stencil computations on CPUs and GPUs;Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis;2019-11-17