1. Christara, C., Ding, X., Jackson, K.: An efficient transposition algorithm for distributed memory computers. In: Proceedings of the High Performance Computing Systems and Applications, pp. 349–368 (1999)
2. Midorikawa, E.T., Oliveira, H.M., Laine, J.M.: PEMPIs: A new metodology for modeling and prediction of MPI programs performance. In: Proceedings of the SBAC-PAD 2004, pp. 254–261. IEEE Computer Society Press, Los Alamitos (2004)
3. Barchet-Steffenel, L.A., Mounie, G.: Scheduling heuristics for efficient broadcast operations on grid environments. In: Proceedings of the Performance Modeling, Evaluation and Optimization of Parallel and Distributed Systems Workshop - PMEO’06 (associated to IPDPS’06), Rhodes Island, Greece, Apr. 2006, IEEE Computer Society Press, Los Alamitos (2006)
4. Kielmann, T., et al.: Network performance-aware collective communication for clustered wide area systems. Parallel Computing 27(11), 1431–1456 (2001)
5. Chun, A.T.T., Wang, C.-L.: Realistic communication model for parallel computing on cluster. In: Proceedings of the International Workshop on Cluster Computing, pp. 92–101 (1999)