Author:
Fang Aiman,Cavelan Aurelien,Robert Yves,Chien Andrew A.
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Improving Scalability of Silent-Error Resilience for Message-Passing Solvers via Local Recovery and Asynchrony;2020 IEEE/ACM 10th Workshop on Fault Tolerance for HPC at eXtreme Scale (FTXS);2020-11
2. FPD
etect;ACM Transactions on Architecture and Code Optimization;2020-09-30
3. Mimic: Fast Recovery from Data Corruption Errors in Stencil Computations;2019 IEEE 38th International Performance Computing and Communications Conference (IPCCC);2019-10
4. Application health monitoring for extreme‐scale resiliency using cooperative fault management;Concurrency and Computation: Practice and Experience;2019-07-25
5. Node failure resiliency for Uintah without checkpointing;Concurrency and Computation: Practice and Experience;2019-06-02