1. Frigo, M., Johnson, S.G.: The design and implementation of FFTW3. Proc. IEEE 93(2), 216–231 (2005)
2. Group Khronos: OpenCL.
https://www.khronos.org/opencl/
(2015)
3. Nath, R., Tomov, S., Dongarra, J., Agullo, E.: Autotuning dense linear algebra libraries on gpus and overview of the magma library. In: 6th International Workshop on Parallel Matrix Algorithms and Applications (PMAA’10), June (2010)
4. NVIDA: CUDA Toolkit Documentation.
http://docs.nvidia.com/cuda/index.html
, September (2015)
5. OpenACC-standard.org.: OpenACC.
http://www.openacc.org/
, March (2012)