1. Understanding failures in petascale computers;Schroeder;Journal of Physics: Conference Series,2007
2. Toward exascale resilience;Cappello;International Journal of High Performance Computing Applications,2009
3. Message Passing Interface Forum, MPI: A Message Passing Interface, in: Proceedings of Supercomputing ’93, IEEE Computer Society Press, 1993, pp. 878–883.
4. E. Gabriel, G.E. Fagg, G. Bosilca, T. Angskun, J.J. Dongarra, J.M. Squyres, V. Sahay, P. Kambadur, B. Barrett, A. Lumsdaine, R.H. Castain, D.J. Daniel, R.L. Graham, T.S. Woodall, Open MPI: Goals, concept, and design of a next generation MPI implementation, in: Proceedings of the 11th European PVM/MPI Users’ Group Meeting, Budapest, Hungary, 2004, pp. 97–104.
5. Fault Tolerance Working Group, Run-though stabilization interfaces and semantics, svn.mpi-forum.org/trac/mpi-forum-web/wiki/ft/run_through_stabilization, July 2011.