Author:
Gonzalo Simon Garcia De,Huang Sitao,Gomez-Luna Juan,Hammond Simon,Mutlu Onur,Hwu Wen-mei
Cited by
20 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Specialized Kernels for Optimizing GPU Offload in OpenMP;Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis;2023-11-12
2. A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code;Proceedings of the 32nd ACM SIGPLAN International Conference on Compiler Construction;2023-02-17
3. Parallelizing Neural Network Models Effectively on GPU by Implementing Reductions Atomically;Proceedings of the International Conference on Parallel Architectures and Compilation Techniques;2022-10-08
4. MemXCT: Design, Optimization, Scaling, and Reproducibility of X-Ray Tomography Imaging;IEEE Transactions on Parallel and Distributed Systems;2022-09-01
5. A Study on Atomics-based Integer Sum Reduction in HIP on AMD GPU;Workshop Proceedings of the 51st International Conference on Parallel Processing;2022-08-29