Author:
Hassani Amin,Skjellum Anthony,Bangalore Purushotham V.,Brightwell Ron
Funder
National Science Foundation
Sandia National Laboratories
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. A Fault-Model-Relevant Classification of Consensus Mechanisms for MPI and HPC;International Journal of Parallel Programming;2022-12-12
2. Compiler aided checkpointing using crash-consistent data structures in NVMM systems;Proceedings of the 34th ACM International Conference on Supercomputing;2020-06-29
3. Fault tolerance of MPI applications in exascale systems: The ULFM solution;Future Generation Computer Systems;2020-05
4. Challenges in Developing MPI Fault-Tolerant Fortran Applications;2018 IEEE International Conference on Cluster Computing (CLUSTER);2018-09
5. Towards a More Complete Understanding of SDC Propagation;Proceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing;2017-06-26