Author:
Larson J.W.,Hegland M.,Harding B.,Roberts S.,Stals L.,Rendell A.P.,Strazdins P.,Ali M.M.,Kowitz C.,Nobes R.,Southern J.,Wilson N.,Li M.,Oishi Y.
Reference38 articles.
1. F. Cappello, Fault tolerance in petascale/exascale systems: Current knowledge, challenges and research opportunities, International Jour- nal of High Performance Computing Applications 23 (3) (2009) 212-226. arXiv:http://hpc.sagepub.com/content/23/3/212.full.pdf+html, doi:10.1177/1094342009106189.
2. W. Gropp, E. Lusk, Fault tolerance in MPI programs, Special issue of the Journal High Performance Computing Applications (IJHPCA) 18 (2002) 363-372.
3. K.-H. Huang, J. A. Abraham, Algorithm-based fault tolerance for matrix operations, IEEE Trans. Comput. 33 (6) (1984) 518-528. doi:10.1109/TC.1984.1676475.
4. G. Bosilca, R. Delmas, J. Dongarra, J. Langou, Algorithm-based fault tolerance applied to high performance computing, J. Parallel Distrib. Comput. 69 (4) (2009) 410-416. doi:10.1016/j.jpdc.2008.12.002.
5. J. Dean, S. Ghemawat, MapReduce: simplified data processing on large clusters, in: OSDI’04: Proceedings of the 6th conference on Symposium on Operating Systems Design & Implementation, USENIX Association, Berkeley, CA, USA, 2004, pp. 10-10.
Cited by
15 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献