1. J. Dongarra, S. Moore, G. Peterson, S. Tomov, J. Allred, V. Natoli, D. Richie, Exploring new architectures in accelerating CFD for Air Force applications, in: Proceedings of HPCMP Users Group Conference, 2008, pp. 14–17.
2. S. Tomov, J. Dongarra, M. Baboulin, Towards dense linear algebra for hybrid GPU accelerated manycore systems.
3. T. Halfhill, Parallel processing with CUDA, Microprocessor Journal.
4. nVidia, Compute Unified Device Architecture Programming Guide version 2.2, April 2009.
5. J. Tölke, Implementation of a lattice Boltzmann kernel using the compute unified device architecture developed by nVIDIA, Computing and Visualization in Science, 1–11.