1. Alvisi, L., Elnozahy, E., Rao, S., Husain, S.A., Mel, A.D.: An analysis of communication induced checkpointing. In: 29th Symposium on Fault-Tolerant Computing (FTCS 1999). IEEE CS Press, Los Alamitos (1999)
2. Lecture Notes in Computer Science;G. Bosilca,2010
3. Bouteiller, A., Bosilca, G., Dongarra, J.: Redesigning the message logging model for high performance. In: ISC 2008, Wiley, Dresden (June 2008) (p. to appear)
4. Bouteiller, A., Ropars, T., Bosilca, G., Morin, C., Dongarra, J.: Reasons to be pessimist or optimist for failure recovery in high performance clusters. In: IEEE (ed.) Proceedings of the 2009 IEEE Cluster Conference (September 2009)
5. Buntinas, D., Coti, C., Herault, T., Lemarinier, P., Pilard, L., Rezmerita, A., Rodriguez, E., Cappello, F.: Blocking vs. non-blocking coordinated checkpointing for large-scale fault tolerant MPI protocols. Future Generation Computer Systems 24(1), 73–84 (2008),
http://www.sciencedirect.com/science/article/B6V06-4N2KT6H-1/2/00e790651475028977cc3031d9ea3980