1. Bell, N., Garland, M.: Efficient Sparse Matrix-Vector Multiplication on CUDA. NVIDIA Technical Report NVR-2008-004, NVIDIA Corporation (2008)
2. Blelloch, G.E., Heroux, M.A., Zagha, M.: Segmented Operations for Sparse Matrix Computation on Vector Multiprocessors. Tech. rep., Tech. Rep. CMU-CS-93-173, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA (1993)
3. Cevahir, A., Nukada, A., Matsuoka, S.: High performance conjugate gradient solver on multi-gpu clusters using hypergraph partitioning. Computer Science - Research and Development 25, 83–91 (2010),
http://dx.doi.org/10.1007/s00450-010-0112-6
4. Davis, T.A., Hu, Y.: The University of Florida Sparse Matrix Collection. ACM Trans. Math. Softw. (to appear)
5. Lecture Notes in Computer Science;T. Kajiyama,2006