Author:
Fagg Graham E.,Dongarra Jack J.
Publisher
Springer Berlin Heidelberg
Reference6 articles.
1. Beck, Dongarra, Fagg, Geist, Gray, Kohl, Migliardi, K. Moore, T. Moore, P. Papadopoulous, S. Scott, V. Sunderam, “HARNESS: a next generation distributed virtual machine”, Journal of Future Generation Computer Systems, (15), Elsevier Science B.V., 1999.
2. G. Stellner, “CoCheck: Checkpointing and Process Migration for MPI”, In Proceedings of the International Parallel Processing Symposium, pp 526–531, Honolulu, April 1996.
3. Adnan Agbaria and Roy Friedman, “Starfish: Fault-Tolerant Dynamic MPI Programs on Clusters of Workstations”, In the 8th IEEE International Symposium on High Performance Distributed Computing, 1999.
4. Graham E. Fagg, Keith Moore, Jack J. Dongarra, “Scalable networked information processing environment (SNIPE)”, Journal of Future Generation Computer Systems, (15), pp. 571–582, Elsevier Science B.V., 1999.
5. Lect Notes Comput Sci;M. Migliardi,1999
Cited by
83 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Rollback-Free Recovery for a High Performance Dense Linear Solver With Reduced Memory Footprint;IEEE Transactions on Parallel and Distributed Systems;2024-07
2. Extending the Legio Resilience Framework to Handle Critical Process Failures in MPI;2024 32nd Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP);2024-03-20
3. Exploit Approximation to Support Fault Resiliency in MPI-based Applications;2023 53rd Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W);2023-06
4. The Legio Fault Resilience Framework;Proceedings of the 20th ACM International Conference on Computing Frontiers;2023-05-09
5. Fault-Aware Group-Collective Communication Creation and Repair in MPI;Euro-Par 2023: Parallel Processing;2023