Author:
Rao S.,Alvisi L.,Vin H.M.
Publisher
Institute of Electrical and Electronics Engineers (IEEE)
Subject
Computational Theory and Mathematics,Computer Science Applications,Information Systems
Cited by
25 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. SplitFT: Fault Tolerance for Disaggregated Datacenters via Remote Memory Logging;Proceedings of the Nineteenth European Conference on Computer Systems;2024-04-22
2. Middleware to Manage Fault Tolerance Using Semi-Coordinated Checkpoints;IEEE Transactions on Parallel and Distributed Systems;2021-02-01
3. Asynchronous Receiver-Driven Replay for Local Rollback of MPI Applications;2019 IEEE/ACM 9th Workshop on Fault Tolerance for HPC at eXtreme Scale (FTXS);2019-11
4. Lineage stash;Proceedings of the 27th ACM Symposium on Operating Systems Principles;2019-10-27
5. Local rollback for resilient MPI applications with application-level checkpointing and message logging;Future Generation Computer Systems;2019-02