MPIWiz-Reference-Cited by-同舟云学术

MPIWiz

Published:2009-02-14 Issue:4 Volume:44 Page:251-260
ISSN:0362-1340
Container-title:ACM SIGPLAN Notices
language:en
Short-container-title:SIGPLAN Not.

Author:

Xue Ruini¹,Liu Xuezheng²,Wu Ming²,Guo Zhenyu²,Chen Wenguang¹,Zheng Weimin¹,Zhang Zheng²,Voelker Geoffrey³

Affiliation:

1. Tsinghua University, Beijing, China

2. Microsoft Research Asia, Beijing, China

3. University of California at San Diego, San Diego, California, USA

Abstract

Message Passing Interface (MPI) is a widely used standard for managing coarse-grained concurrency on distributed computers. Debugging parallel MPI applications, however, has always been a particularly challenging task due to their high degree of concurrent execution and non-deterministic behavior. Deterministic replay is a potentially powerful technique for addressing these challenges, with existing MPI replay tools adopting either data-replay or order-replay approaches. Unfortunately, each approach has its tradeoffs. Data-replay generates substantial log sizes by recording every communication message. Order-replay generates small logs, but requires all processes to be replayed together. We believe that these drawbacks are the primary reasons that inhibit the wide adoption of deterministic replay as the critical enabler of cyclic debugging of MPI applications. This paper describes subgroup reproducible replay (SRR), a hybrid deterministic replay method that provides the benefits of both data-replay and order-replay while balancing their trade-offs. SRR divides all processes into disjoint groups. It records the contents of messages crossing group boundaries as in data-replay, but records just message orderings for communication within a group as in order-replay. In this way, SRR can exploit the communication locality of traffic patterns in MPI applications. During replay, developers can then replay each group individually. SRR reduces recording overhead by not recording intra-group communication, and reduces replay overhead by limiting the size of each replay group. Exposing these tradeoffs gives the user the necessary control for making deterministic replay practical for MPI applications. We have implemented a prototype, MPIWiz, to demonstrate and evaluate SRR. MPIWiz employs a replay framework that allows transparent binary instrumentation of both library and system calls. As a result, MPIWiz replays MPI applications with no source code modification and relinking, and handles non-determinism in both MPI and OS system calls. Our preliminary results show that MPIWiz can reduce recording overhead by over a factor of four relative to data-replay, yet without requiring the entire application to be replayed as in order-replay. Recording increases execution time by 27% while the application can be replayed in just 53% of its base execution time.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Graphics and Computer-Aided Design,Software

Link

https://dl.acm.org/doi/pdf/10.1145/1594835.1504213

Reference38 articles.

1. An improved two-way partitioning algorithm with stable performance (VLSI)

2. Automated, scalable debugging of MPI programs with Intel® Message Checker

Cited by 40 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Reproducibility, Replicability and Repeatability: A survey of reproducible research with a focus on high performance computing;Computer Science Review;2024-08

2. Efficient Deadlock Detection in MPI Programs with Path Compression and Focus Matching;Proceedings of the 15th Asia-Pacific Symposium on Internetware;2024-07-24

3. Program partitioning and deadlock analysis for MPI based on logical clocks;Parallel Computing;2024-02

4. A Survey of Graph Comparison Methods with Applications to Nondeterminism in High-Performance Computing;The International Journal of High Performance Computing Applications;2023-04-05

5. Improving the Efficiency of Deadlock Detection in MPI Programs Through Trace Compression;IEEE Transactions on Parallel and Distributed Systems;2023-01-01