1. Adhianto, L., et al.: HPCTOOLKIT: tools for performance analysis of optimized parallel programs. Concurr. Comput.: Pract. Exper. 22(6), 685–701 (2010)
2. Alam, S., Vetter, J.: A framework to develop symbolic performance models of parallel applications. In: Parallel and Distributed Processing Symposium, vol. 0, p. 368 (2006)
3. Bernard, C., Ogilvie, M.C., DeGrand, T.A., DeTar, C.E., Gottlieb, S.A., Krasnitz, A., Sugar, R., Toussaint, D.: Studying Quarks and Gluons On MIMD Parallel Computers. Intl. Journal of High Perf. Comp. Applications 5(4), 61–70 (1991)
4. Geimer, M., Wolf, F., Wylie, B.J.N., Ábrahám, E., Becker, D., Mohr, B.: The Scalasca performance toolset architecture. Concurr. Comput.: Pract. Exper. 22(6), 702–719 (2010)
5. Gerndt, M., Ott, M.: Automatic performance analysis with Periscope. Concurr. Comput.: Pract. Exper. 22(6), 736–748 (2010)