1. On the implementation of fast marching methods for 3D lattices, Tech. Rep., Informatics and Mathematical Modelling;Bærentzen,2001
2. Efficient sparse matrix–vector multiplication on cuda, Tech. Rep. NVR-2008-004;Bell,2008
3. N. Bell, M. Garland, Implementing sparse matrix–vector multiplication on throughput-oriented processors, in: Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, 2009, pp. 18:1–18:11.
4. A comprehensive comparison of GPU-and FPGA-based acceleration of reflection image reconstruction for 3D ultrasound computer tomography;Birk;J. Real-Time Image Process.,2012
5. GPU accelerated greedy algorithms for compressed sensing;Blanchard;Mathematical Programming Computation,2013