1. [n.d.]. ARM PERFORMANCE LIBRARIES. https://developer.arm.com/tools-and-software/server-and-hpc/compile/arm-compiler-for-linux/arm-performance-libraries [n.d.]. ARM PERFORMANCE LIBRARIES. https://developer.arm.com/tools-and-software/server-and-hpc/compile/arm-compiler-for-linux/arm-performance-libraries
2. [n.d.]. Intel oneAPI Math Kernel Library. https://www.intel.com/content/www/us/en/developer/tools/oneapi/onemkl.html [n.d.]. Intel oneAPI Math Kernel Library. https://www.intel.com/content/www/us/en/developer/tools/oneapi/onemkl.html
3. [n.d.]. OpenBLAS:An optimized BLAS library. http://www.openblas.net/ [n.d.]. OpenBLAS:An optimized BLAS library. http://www.openblas.net/
4. Performance, Design, and Autotuning of Batched GEMM for GPUs
5. Demystifying Parallel and Distributed Deep Learning