1. D. Bertsekas and J. Tsitsiklis, Parallel and Distributed Computation: Numerical Methods, Prentice Hall, 1989.
2. J. Choi, J. Dongarra and D. Walker, Parallel Matrix Transpose Algorithms on Distributed Memory Concurrent Computers, Parallel Computing, vol. 21, 1995, pp.1387–1405.
3. X. Ding, Numerical Solution of the Shallow-Water Equations on Distributed Memory Systems, M.Sc. Thesis, Computer Science Department, University of Toronto, 1998. Available from
http://www.cs.toronto.edu/NA/reports.html
.
4. I. Foster, Designing and Building Parallel Programs: Concepts and Tools for Parallel Software Engineering, Addison-Wesley, 1995.
5. W. Gropp, Using MPI: portable parallel programming with the message-passing interface, MIT Press, 1994.