1. Checkpoint/Restart for CUDA Kernels;Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis;2023-11-12
2. Fault Tolerant High Performance Solver for Linear Equation Systems;2019 38th Symposium on Reliable Distributed Systems (SRDS);2019-10
3. Wireless Grids;Advances in Wireless Technologies and Telecommunication;2016
4. A Method of Self-Adaptive Pre-Copy Container Checkpoint;2015 IEEE 21st Pacific Rim International Symposium on Dependable Computing (PRDC);2015-11
5. Fault-Tolerant MPI;Computer Communications and Networks;2015