Author:
Sørensen Hans Henrik Brandenborg
Publisher
Springer Berlin Heidelberg
Reference15 articles.
1. NVIDIA Corp.: CUDA Toolkit Version 3.2. (2010)
2. Khronos Group: OpenCL Specification 1.1. (2010)
3. Dongarra, J.J., Du Croz, J., Hammarling, S., Duff, I.S.: A set of level 3 basic linear algebra subprograms. ACM Trans. Math. Softw. 16, 1–17 (1990)
4. Anderson, E., Bai, Z., Bischof, C., Blackford, L.S., Demmel, J., Dongarra, J.J., Du Croz, J., Hammarling, S., Greenbaum, A., McKenney, A., Sorensen, D.: LAPACK Users’ guide, 3rd edn. SIAM, Philadelphia (1999)
5. Tomov, S., Nath, R., Du, P., Dongarra, J.: MAGMA v0.2 Users’ Guide (2009)
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Block-Size Independence for GPU Programs;Static Analysis;2018
2. MATOG;ACM Transactions on Architecture and Code Optimization;2017-09-06
3. An Efficient GPU Implementation of Bulk Computation of the Eigenvalue Problem for Many Small Real Non-symmetric Matrices;International Journal of Networking and Computing;2017
4. Automatic Thread-Block Size Adjustment for Memory-Bound BLAS Kernels on GPUs;2016 IEEE 10th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSOC);2016-09
5. Adaptive GPU Array Layout Auto-Tuning;Proceedings of the ACM Workshop on Software Engineering Methods for Parallel and High Performance Applications;2016-05-31