1. Vector Models for Data-Parallel Computing;Blelloch,1990
2. Parallel prefix sum (scan) with CUDA;Harris,2007
3. Scan primitives for GPU computing;Sengupta,2007
4. Fast scan algorithms on graphics processors;Dotsenko,2008
5. S. Sengupta, M. Harris, M. Garland, Efficient Parallel scan algorithms for GPUs, Technical Report NVR-2008-003, NVIDIA Corporation, 2008.