Publisher
Springer International Publishing
Reference23 articles.
1. Antão, S.F., et al.: Offloading support for OpenMP in Clang and LLVM. In: Third Workshop on the LLVM Compiler Infrastructure in HPC, LLVM-HPC@SC 2016, Salt Lake City, UT, USA, 14 November 2016, pp. 1–11. IEEE Computer Society (2016).
https://doi.org/10.1109/LLVM-HPC.2016.006
2. Bercea, G., et al.: Implementing implicit OpenMP data sharing on GPUs. In: Proceedings of the Fourth Workshop on the LLVM Compiler Infrastructure in HPC, LLVM-HPC@SC 2017, Denver, CO, USA, 13 November 2017, pp. 5:1–5:12. ACM (2017).
https://doi.org/10.1145/3148173.3148189
3. Bertolli, C., et al.: Integrating GPU support for OpenMP offloading directives into Clang. In: Finkel, H. (ed.) Proceedings of the Second Workshop on the LLVM Compiler Infrastructure in HPC, LLVM 2015, Austin, Texas, USA, 15 November 2015, pp. 5:1–5:11. ACM (2015).
https://doi.org/10.1145/2833157.2833161
4. Bertolli, C., et al.: Coordinating GPU threads for OpenMP 4.0 in LLVM. In: Finkel, H., Hammond, J.R. (eds.) Proceedings of the 2014 LLVM Compiler Infrastructure in HPC, LLVM 2014, New Orleans, LA, USA, 17 November 2014, pp. 12–21. IEEE Computer Society (2014).
https://doi.org/10.1109/LLVM-HPC.2014.10
5. Bertolli, C., Bercea, G.: Performance portability with OpenMP on Nvidia GPUs. In: DOE Centers of Excellence Performance Portability Meeting (2016).
https://asc.llnl.gov/DOE-COE-Mtg-2016/talks/2-19_Bertolli.pdf
Cited by
15 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Porting HPC Applications to AMD Instinct™ MI300A using Unified Memory and OpenMP®;ISC High Performance 2024 Research Paper Proceedings (39th International Conference);2024-05
2. Specialized Kernels for Optimizing GPU Offload in OpenMP;Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis;2023-11-12
3. Implementing OpenMP’s SIMD Directive in LLVM’s GPU Runtime;Proceedings of the 52nd International Conference on Parallel Processing;2023-08-07
4. Maximizing Parallelism and GPU Utilization For Direct GPU Compilation Through Ensemble Execution;Proceedings of the 52nd International Conference on Parallel Processing Workshops;2023-08-07
5. High-Performance GPU-to-CPU Transpilation and Optimization via High-Level Parallel Constructs;Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming;2023-02-21