Author:
Maroñas Marcos,Mateo Sergi,Keller Kai,Bautista-Gomez Leonardo,Ayguadé Eduard,Beltran Vicenç
Funder
Spanish Ministerio de Ciencia, Innovación y Universidades
Generalitat de Catalunya
European Union’s Seventh Framework Programme
Horizon 2020
Subject
Computer Networks and Communications,Hardware and Architecture,Software
Reference45 articles.
1. High-End Computing: The Challenge of Scale;Reed,2004
2. Toward exascale resilience: 2014 update;Cappello;Supercomput. Front. Innov.,2014
3. Detecting and correcting data corruption in stencil applications through multivariate interpolation;Bautista-Gomez,2015
4. Algorithm-based fault tolerance for dense matrix factorizations;Du;ACM SIGPLAN Not.,2012
5. Berkeley lab checkpoint/restart (BLCR) for Linux clusters;Hargrove;J. Phys. Conf. Ser.,2006
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Task-Level Resilience: Checkpointing vs. Supervision;International Journal of Networking and Computing;2022
2. Assessing the Use Cases of Persistent Memory in High-Performance Scientific Computing;2021 IEEE/ACM 11th Workshop on Fault Tolerance for HPC at eXtreme Scale (FTXS);2021-11
3. Checkpointing vs. Supervision Resilience Approaches for Dynamic Independent Tasks;2021 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW);2021-06