1. [1] E. Agullo, C. Augonnet, J. Dongarra, M. Faverge, J. Langou, H. Ltaief, and S. Tomov. LU factorization for accelerator-based systems. In Computer Systems and Applications (AICCSA), 2011 9th IEEE/ACS International Conference on, pages 217 -224, Dec. 2011.
2. [2] E. Agullo, C. Augonnet, J. Dongarra, M. Faverge, H. Ltaief, S. Thibault, and S. Tomov. QR factorization on a multicore node enhanced with multiple GPU accelerators. In 25th IEEE International Symposium on Parallel and Distributed Processing, IPDPS'11, pages 932 -943, May 2011.
3. [3] E. Agullo, C. Augonnet, J. Dongarra, H. Ltaief, R. Namyst, S. Thibault, and S. Tomov. Faster, Cheaper, Better - a Hybridization Methodology to Develop Linear Algebra Software for GPUs. In W. mei W. Hwu, editor, GPU Computing Gems, volume 2. Morgan Kaufmann, Sep. 2010.
4. [4] AMD. The industry-changing impact of accelerated computing. AMD Whitepaper, Advance Micro Devices, 2008. 09/04/2011.
5. [5] E. Anderson, Z. Bai, C. Bischof, J. Demmel, J. Dongarra, J. Du Croz, A. Greenbaum, S. Hammarling, A. McKenney, S. Ostrouchov, and D. Sorensen. LAPACK's user's guide. Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, 1992.