1. Neural network based silent error detector;Wang,2018
2. Analyzing a five-year failure record of a leadership-class supercomputer;Rojas,2019
3. Failures in large scale systems: long-term measurement, analysis, and implications;Gupta,2017
4. A large-scale study of failures on petascale supercomputers;Liu;J. Comput. Sci. Technol.,2018