Funder
Netherlands eScience Center
Subject
Computer Networks and Communications,Hardware and Architecture,Software
Reference47 articles.
1. Program optimization space pruning for a multithreaded gpu;Ryoo,2008
2. C. Nugteren, V. Codreanu, CLTune: A generic auto-tuner for OpenCL kernels, in: 2015 IEEE 9th International Symposium on Embedded Multicore/Many-core Systems-on-Chip, MCSoC, 2015, pp. 195–202.
3. Maestro: Data orchestration and tuning for OpenCL devices;Spafford,2010
4. Autotuning GPU Kernels via Static and Predictive Analysis;Lim,2017
5. Auto-Tuning dedispersion for many-core accelerators;Sclocco,2014
Cited by
44 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Automated Backend Allocation for Multi-Model, On-Device AI Inference;Proceedings of the ACM on Measurement and Analysis of Computing Systems;2023-12-07
2. Scalable Tuning of (OpenMP) GPU Applications via Kernel Record and Replay;Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis;2023-11-11
3. Performance Tuning for GPU-Embedded Systems: Machine-Learning-Based and Analytical Model-Driven Tuning Methodologies;2023 IEEE 35th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD);2023-10-17
4. Improve the Performance of Parallel Reduction on General-Purpose Graphics Processor Units Using Prediction Models;Proceedings of the International Conference on Research in Adaptive and Convergent Systems;2023-08-06
5. Revisiting Temporal Blocking Stencil Optimizations;Proceedings of the 37th International Conference on Supercomputing;2023-06-21