Author:
Ammendola Roberto,Biagioni Andrea,Frezza Ottorino,Lo Cicero Francesca,Lonardo Alessandro,Paolucci Pier Stanislao,Rossetti Davide,Simula Francesco,Tosoratto Laura,Vicini Piero
Funder
EU Framework Programme 7 project EURETILE
MIUR (Italy)
Subject
Computer Networks and Communications,Hardware and Architecture,Software
Reference19 articles.
1. A survey of fault tolerance mechanisms and checkpoint/restart implementations for high performance computing systems;Egwutuoha;J. Supercomput.,2013
2. J.R. Stearley, et al. Increasing fault resiliency in a message-passing environment, 2009. http://dx.doi.org/10.2172/1001015, URL: http://www.osti.gov/scitech/servlets/purl/1001015.
3. Fault prediction under the microscope: a closer look into HPC systems;Gainaru,2012
4. P.S. Paolucci, I. Bacivarov, G. Goossens, R. Leupers, F. Rousseau, C. Schumacher, L. Thiele, P. Vicini, EURETILE 2010–2012 summary: first three years of activity of the European Reference Tiled Experiment, arXiv:1305.1459, http://dx.doi.org/10.12837/2013T01.
5. The ganglia distributed monitoring system: design, implementation, and experience;Massie;Parallel Comput.,2004
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献