1. Near-optimal loop tiling by means of cache miss equations and genetic algorithms;Abella,2002
2. LAPACK Users’ Guide;Anderson,1999
3. R. Berrendorf, B. Mohr, PCL—The Performance Counter Library: A Common Interface to Access Hardware Performance Counters on Microprocessors (Version 2.2), Research Centre Jülich, January 2003.
4. Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology;Bilmes,1997
5. R.W. Brankin, I. Gladwell, L.F. Shampine, RKSUITE release 1.0 1991.