1. Bieker, B., Deconinck, G., Maehle, E., and Vounckx, J. (1994). Reconfiguration and Checkpointing in Massively Parallel Systems. In First European Dependable Computing Conference, EDCC-1, Lecture Notes in Computer Science 852, pages 353–370, Berlin. Springer-Verlag.
2. Bieker, B. and Maehle, E. (1998). User-Transparent Checkpointing and Restart for Parallel Computers. In Avresky, D. R. and Kaeli, D. R., editors, Fault-Tolerant Parallel and Distributed Systems, pages 385–399, Boston. Kluwer Academic Publishers.
3. Brehm, J., Worley, P. H., and Madhukar, M. (1996). Performance Modelling for SPMD Message-Passing Programs. Technical Report ORNL/TM-13254, Oak Ridge National Laboratory.
4. Elnozahy, E. N. and Zwaenepoel, W. (1994). On the Use and Implemantation of Message Logging. In Proc. 24 th Int. Fault-Tolerant Computing Symposium FTCS-24, pages 298–307.
5. Foster, I. (1995). Designing and Building Parallel Programs — Concepts and Tools for Parallel Software Engineering. Addison Wesley, New York.