1. Cross component optimisation in a high level category-based language;Ashby,2004
2. Compiler transformations for high-performance computing;Bacon;ACM Computing Surveys,1994
3. Runtime code generation in C++ as a foundation for domain-specific optimisation;Beckmann,2003
4. Efficient interprocedural data placement optimisation in a parallel library;Beckmann,1998
5. An updated set of basic linear algebra subprograms (BLAS);Blackford;ACM Trans. Math. Softw.,2002