1. 2009. Intel Math Kernel Library. Reference Manual . Intel Corporation , Santa Clara, USA. ISBN 630813-054US. 2009. Intel Math Kernel Library. Reference Manual. Intel Corporation, Santa Clara, USA. ISBN 630813-054US.
2. 2012. AMD Core Math Library (ACML) User Guide. Advanced Micro Systems (AMD) Santa Ana USA. https://developer.amd.com/wordpress/media/2012/10/acml_userguide.pdf\.pdf 2012. AMD Core Math Library (ACML) User Guide. Advanced Micro Systems (AMD) Santa Ana USA. https://developer.amd.com/wordpress/media/2012/10/acml_userguide.pdf\.pdf
3. 2021. AMD Optimizing CPU Libraries User Guide. Advanced Micro Systems (AMD) Santa Ana USA. https://developer.amd.com/wp-content/resources/AOCL_User%20Guide_3.0.pdf/ 2021. AMD Optimizing CPU Libraries User Guide. Advanced Micro Systems (AMD) Santa Ana USA. https://developer.amd.com/wp-content/resources/AOCL_User%20Guide_3.0.pdf/
4. Ahmad Abdelfattah , Stanimire Tomov , and Jack Dongarra . 2019. Fast batched matrix multiplication for small sizes using half-precision arithmetic on gpus . In IEEE IPDPS. IEEE , 111--122. Ahmad Abdelfattah, Stanimire Tomov, and Jack Dongarra. 2019. Fast batched matrix multiplication for small sizes using half-precision arithmetic on gpus. In IEEE IPDPS. IEEE, 111--122.
5. Emmanuel Agullo Cédric Augonnet Jack Dongarra Hatem Ltaief Raymond Namyst Samuel Thibault and Stanimire Tomov. 2010. Faster Cheaper Better - a Hybridization Methodology to Develop Linear Algebra Software for GPUs. Emmanuel Agullo Cédric Augonnet Jack Dongarra Hatem Ltaief Raymond Namyst Samuel Thibault and Stanimire Tomov. 2010. Faster Cheaper Better - a Hybridization Methodology to Develop Linear Algebra Software for GPUs.