1. YELICK K. Ten ways to waste a parallel computer [C]// Proceedings of the 36th Annual International Symposium on Computer Architecture. Austin, TX, USA: ACM, 2009: 1.
2. LI D, de SUPINSKI B, SCHULZ M, CAMERON K, NIKOLOPOULOS D S. Hybrid MPI/OpenMP power-aware computing [C]// Proceedings of 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS). Atlanta, GA: IEEE Press, 2010: 1–12.
3. PJESIVAC-GRBOVIC J, ANGSKUN T, BOSILCA G, FAGG G E, GABRIEL E, DONGARRA J J. Performance analysis of MPI collective operations [C]// Cluster Computing-07. Hingham, MA, USA: Kluwer Academic Publishers, 2007: 127–143.
4. YEW P C, TZENG N F, LAWRIE D H. Distributing hot-spot addressing in large scale multiprocessors [J]. IEEE Transactions on Computers, 1987, C-36(4): 388–395.
5. HENSGEN D, FINKEL R, MANBER U. Two algorithms for barrier synchronization [J]. Int J Parallel Program, 1988, 17(1): 1–17.