1. Cico, L., J. Greene, and R. Cooper. 2005. Performance estimates of a STAP benchmark on the IBM Cell processor. Proceedings of the Ninth Annual High Performance Embedded Computing Workshop. Available online at http://www.ll.mit.edu/HPEC/agendas/proc05/agenda.html.
2. Cummings, J., J. Crotinger, S. Haney, W. Humphrey, S. Karmesin, J. Reynders, S. Smith, and T. Williams. 1998. Rapid application development and enhanced code portability using the POOMA framework. Presented at theSIAM Workshop on Object-Oriented Methods for Interoperable Scienti~c and Engineering Computing. Yorktown Heights, NY.
3. Self-Adapting Linear Algebra Algorithms and Software
4. Franchetti, F. and M. Püschel. 2003. Short vector code generation for the discrete Fourier transform. Proceedings of the 17th International Parallel and Distributed Processing Symposium 58-67.
5. The Design and Implementation of FFTW3