1. Optimization of multi-level checkpoint model for large scale HPC applications;Di,2014
2. Science Prospects and Benefits with Exascale Computing;Kothe,2007
3. Modeling coordinated checkpointing for large-scale supercomputers;Wang,2005
4. Design, modeling, and evaluation of a scalable multi-level checkpointing system;Moody,2010
5. System-level fault-tolerance in large-scale parallel machines with buffered coscheduling;Petrini,2004