CLBlast-Reference-Cited by-同舟云学术

CLBlast

Published:2018-05-14 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the International Workshop on OpenCL
language:
Short-container-title:

Author:

Nugteren Cedric¹

Affiliation:

1. TomTom, Amsterdam, The Netherlands

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3204919.3204924

Reference20 articles.

1. Performance, Design, and Autotuning of Batched GEMM for GPUs

2. R. Ballester-Ripoll E. G. Paredes and R. Pajarola. 2017. Sobol Tensor Trains for Global Sensitivity Analysis. ArXiv e-prints (Dec. 2017). arXiv:1712.00233 R. Ballester-Ripoll E. G. Paredes and R. Pajarola. 2017. Sobol Tensor Trains for Global Sensitivity Analysis. ArXiv e-prints (Dec. 2017). arXiv:1712.00233

3. The future of microprocessors

4. Sharan Chetlur Cliff Woolley Philippe Vandermersch Jonathan Cohen John Tran Bryan Catanzaro and Evan Shelhamer. 2014. cuDNN: Efficient Primitives for Deep Learning. (2014). Sharan Chetlur Cliff Woolley Philippe Vandermersch Jonathan Cohen John Tran Bryan Catanzaro and Evan Shelhamer. 2014. cuDNN: Efficient Primitives for Deep Learning. (2014).

5. Machine Learning Based Auto-Tuning for Enhanced OpenCL Performance Portability

Cited by 45 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. COALA: A Compiler-Assisted Adaptive Library Routines Allocation Framework for Heterogeneous Systems;IEEE Transactions on Computers;2024-07

2. Towards Dynamic Autotuning of SpMV in CUSP Library;2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW);2024-05-27

3. Mobiprox: Supporting Dynamic Approximate Computing on Mobiles;IEEE Internet of Things Journal;2024-05-01

4. FlexGEMM: A Flexible Micro-kernel Generation Framework;Proceedings of the 5th International Conference on Computer Information and Big Data Applications;2024-04-26

5. Opencl-pytorch: an OpenCL-based extension of PyTorch;CCF Transactions on High Performance Computing;2024-04-08