1. S. Sengupta, M. Harris, Y. Zhang, J.D. Owens, Scan primitives for gpu computing, in: Proceedings of the 22nd ACM SIG- GRAPH/EUROGRAPHICS symposium on Graphics hardware, GH’07, Eurographics Association, Aire-la-Ville, Switzerland, Switzer- land, 2007, pp. 97-106. URL http://dl.acm.org/citation.cfm?id=1280094.1280110.
2. S. Collange, M. Daumas, D. Defour, Graphic processors to speed-up simulations for the design of high performance solar receptors, in: Application-specific Systems, Architectures and Processors, 2007. ASAP. IEEE International Conf. on, IEEE, 2007, pp. 377-382.
3. Z. Wei, J. JaJa, Optimization of linked list prefix computations on multithreaded gpus using cuda, in: Parallel Distributed Processing (IPDPS), 2010 IEEE International Symposium on, 2010, pp. 1-8. doi:10.1109/IPDPS. 2010.5470455.
4. K. Hawick, A. Leist, D. Playne, Parallel graph component labelling with gpus and cuda, Parallel Computing 36 (12) (2010) 655-678. doi:10.1016/j.parco.2010.07.002. URL http://www.sciencedirect.com/science/article/pii/S0167819110001055.
5. C. Leiserson, B.M. Maggs, Communication-efficient parallel algorithms for distributed random-access machines, Algorithmica 3 (1988) 53-77.