1. A Recursive Algebraic Coloring Technique for Hardware-efficient Symmetric Sparse Matrix-vector Multiplication
2. J. I. Aliaga , H. Anzt , T. Grützmacher , E. S. Quintana-Ortí , and A. E. Tomás . Compression and load balancing for efficient sparse matrix-vector product on multicore processors and graphics processing units. Concurrency and Computation: Practice and Experience, 34(14) , 2022 . J. I. Aliaga, H. Anzt, T. Grützmacher, E. S. Quintana-Ortí, and A. E. Tomás. Compression and load balancing for efficient sparse matrix-vector product on multicore processors and graphics processing units. Concurrency and Computation: Practice and Experience, 34(14), 2022.
3. Ginkgo
: A Modern Linear Operator Algebra Framework for High Performance Computing
4. H. Anzt , T. Cojean , C. Yen-Chen , J. Dongarra , G. Flegar , P. Nayak , S. Tomov , Y. M. Tsai , and W. Wang . Load-balancing sparse matrix vector product kernels on gpus. ACM Transactions on Parallel Computing , 7 ( 1 ), 2020 . H. Anzt, T. Cojean, C. Yen-Chen, J. Dongarra, G. Flegar, P. Nayak, S. Tomov, Y. M. Tsai, and W. Wang. Load-balancing sparse matrix vector product kernels on gpus. ACM Transactions on Parallel Computing, 7(1), 2020.
5. On the performance and energy efficiency of sparse linear algebra on GPUs