1. A. Agbaria, R. Friedman, Starfish: Fault-tolerant dynamic MPI programs on clusters of workstations, in: 8th IEEE International Symposium on High Performance Distributed Computing, 1999
2. Bounds on algorithm-based fault tolerance in multiple processor systems;Banerjee;IEEE Transactions on Computers,1986
3. Algorithm-based fault-tolerance on a hypercube multiprocessor;Banerjee;IEEE Transactions on Computers,1990
4. A. Bouteiller, G. Bosilca, J. Dongarra, Redesigning the message logging model for high performance, in: ISC 2008, International Supercomputing Conference, Dresden, Germany, June 17–20, 2008
5. A. Bouteiller, P. Lemarinier, G. Krawezik, F. Cappello, Coordinated checkpoint versus message log for fault tolerant MPI, in: Proceedings of Cluster 2003, Hong Kong, December 2003