Author:
Abdelfattah Ahmad,Haidar Azzam,Tomov Stanimire,Dongarra Jack
Reference21 articles.
1. Suitesparse : A suite of sparse matrix software. http://faculty.cse.tamu.edu/davis/suitesparse.html
2. LAPACK Working Note 41: Installation Guide for LAPACK, 1999. http://www.netlib.org/lapack/lawnspdf/lawn41.pdf.
3. A. Abdelfattah, A. Haidar, S. Tomov, and J. Dongarra. On the Development of Variable Size Batched Computation for Heterogeneous Parallel Architectures. In 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, IPDPS Workshops 2016, Chicago, IL, USA, May 23-27, 2016, pages 1249–1258, 2016.
4. A. Abdelfattah, A. Haidar, S. Tomov, and J. Dongarra. Performance, Design, and Autotuning of Batched GEMM for GPUs. In High Performance Computing - 31st International Conference, ISC High Performance 2016, Frankfurt, Germany, June 19-23, 2016, Proceedings, pages 21–38, 2016.
5. M. Anderson, D. Sheffield, and K. Keutzer. A Predictive Model for Solving Small Linear Algebra Problems in GPU Registers. In IEEE 26th International Parallel Distributed Processing Symposium (IPDPS), 2012.
Cited by
13 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. MAGMA: Enabling exascale performance with accelerated BLAS and LAPACK for diverse GPU architectures;The International Journal of High Performance Computing Applications;2024-06-20
2. GPU-based LU Factorization and Solve on Batches of Matrices with Band Structure;Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis;2023-11-12
3. Fast algorithm for parallel solving inversion of large scale small matrices based on GPU;The Journal of Supercomputing;2023-05-13
4. An Optimized Framework for Matrix Factorization on the New Sunway Many-core Platform;ACM Transactions on Architecture and Code Optimization;2023-03
5. Batched LU Factorization With Fast Row Interchanges for Small Matrices on GPUs;2022 IEEE 24th Int Conf on High Performance Computing & Communications; 8th Int Conf on Data Science & Systems; 20th Int Conf on Smart City; 8th Int Conf on Dependability in Sensor, Cloud & Big Data Systems & Application (HPCC/DSS/SmartCity/DependSys);2022-12