1. [1] J. Hennessy and D. Patterson, Computer Architecture: A Quantitative Approach, 4th Edition, Morgan Kaufmann Publishers, 2007.
2. [2] T. Hoefler, et al., “A Survey of Barrier Algorithms for Coarse Grained Supercomputers,” Chemnitzer Informatik Berichte, vol. 4, no. 3, Dec. 2004.
3. [3] B. Wilkinson, Parallel Programming: Techniques and Applications Using Networked Workstations and Parallel Computers, Prentice Hall, 2004.
4. [4] O. Villa, et al., “Efficiency and Scalability of Barrier Synchronization on NoC Based Many-core Architectures,” Proc. Int'l Conf. on Compilers, Architectures and Synthesis for Embedded Systems (CASES'08), pp. 81-89, 2008.
5. Algorithms for scalable synchronization on shared-memory multiprocessors