1. Ferreira, K., Riesen, R., Oldfield, R., Stearley, J., Laros, J., Pedretti, K., Brightwell, R., Kordenbrock, T.: Increasing fault resiliency in a message-passing environment. Technical report SAND2009-6753, Sandia National Laboratories (2009)
2. Riesen, R., Ferreira, K., Stearley, J.: See applications run and throughput jump: The case for redundant computing in HPC. In: 1st International Workshop on Fault-Tolerance for HPC at Extreme Scale, FTXS 2010 (2010)
3. Network-Based Computing Laboratory, Ohio State University: OSU MPI benchmarks, OMB (2010), http://mvapich.cse.ohio-state.edu/benchmarks/
4. Schroeder, B., Gibson, G.A.: Understanding failures in petascale computers. Journal of Physics: Conference Series 78(1), 188–198 (2007)
5. Zheng, Z., Lan, Z.: Reliability-aware scalability models for high performance computing, In: Proceedings of the IEEE conference on Cluster Computing (2009)