1. Ahn JH, Erez M, Dally WJ (2005) Scatter-Add in data parallel architectures. In: HPCA’05: international symposium on high-performance computer architecture, pp 132–142
2. Arevalo A, Matinata RM, Pandian M, Peri E, Ruby K, Thomas F, Almond C. Programming the cell broadband engine architecture: examples and best practices. http://www.redbooks.ibm.com/redbooks/pdfs/sg247575.pdf
3. Asanovic K, Bodik R, Catanzaro BC, Gebis JJ, Husbands P, Keutzer K, Patterson DA, Plishker WL, Shalf J, Williams SW, Yelick KA (2006) The landscape of parallel computing research: a view from Berkeley. Tech Rep UCB/EECS-2006-183, EECS Department, University of California, Berkeley
4. Balart J, Duran A, Gonzalez M, Martorell X, Ayguade E, Labarta J (2004) Nanos Mercurium: a research compiler for OpenMP. In: EWOMP’04: European workshop on OpenMP, pp 103–109
5. Bellens P, Perez JM, Badia RM, Labarta J (2006) CellSs: a programming model for the Cell BE architecture. In: SC’06: ACM/IEEE conference on supercomputing, p 86