1. N. Bell, M. Garland, Implementing Sparse Matrix–Vector Multiplication on throughput-oriented processors, in: SC, ACM, 2009, pp. 1–11.
2. N. Bell, M. Garland, Cusp: Generic Parallel Algorithms for Sparse Matrix and Graph Computations (2012). , version 0.3.0.
3. Model-driven autotuning of sparse matrix–vector multiply on GPUs;Choi;ACM SIGPLAN Not.,2010
4. Automatically tuning sparse matrix-vector multiplication for GPU architectures;Monakov,2010
5. M.M. Baskaran, R. Bordawekar, Optimizing sparse matrix–vector multiplication on GPUs, Tech. Rep., IBM TJ Watson Research Center, 2008.