The TRegion Interface and Compiler Optimizations for OpenMP Target Regions-Reference-Cited by-同舟云学术

The TRegion Interface and Compiler Optimizations for OpenMP Target Regions

Published:2019 Issue: Volume: Page:153-167
ISSN:0302-9743
Container-title:OpenMP: Conquering the Full Hardware Spectrum
language:
Short-container-title:

Author:

Doerfert Johannes^ORCID,Diaz Jose Manuel Monsalve^ORCID,Finkel Hal^ORCID

Publisher

Springer International Publishing

Link

http://link.springer.com/content/pdf/10.1007/978-3-030-28596-8_11

Reference23 articles.

1. Antão, S.F., et al.: Offloading support for OpenMP in Clang and LLVM. In: Third Workshop on the LLVM Compiler Infrastructure in HPC, LLVM-HPC@SC 2016, Salt Lake City, UT, USA, 14 November 2016, pp. 1–11. IEEE Computer Society (2016). https://doi.org/10.1109/LLVM-HPC.2016.006

2. Bercea, G., et al.: Implementing implicit OpenMP data sharing on GPUs. In: Proceedings of the Fourth Workshop on the LLVM Compiler Infrastructure in HPC, LLVM-HPC@SC 2017, Denver, CO, USA, 13 November 2017, pp. 5:1–5:12. ACM (2017). https://doi.org/10.1145/3148173.3148189

3. Bertolli, C., et al.: Integrating GPU support for OpenMP offloading directives into Clang. In: Finkel, H. (ed.) Proceedings of the Second Workshop on the LLVM Compiler Infrastructure in HPC, LLVM 2015, Austin, Texas, USA, 15 November 2015, pp. 5:1–5:11. ACM (2015). https://doi.org/10.1145/2833157.2833161

4. Bertolli, C., et al.: Coordinating GPU threads for OpenMP 4.0 in LLVM. In: Finkel, H., Hammond, J.R. (eds.) Proceedings of the 2014 LLVM Compiler Infrastructure in HPC, LLVM 2014, New Orleans, LA, USA, 17 November 2014, pp. 12–21. IEEE Computer Society (2014). https://doi.org/10.1109/LLVM-HPC.2014.10

5. Bertolli, C., Bercea, G.: Performance portability with OpenMP on Nvidia GPUs. In: DOE Centers of Excellence Performance Portability Meeting (2016). https://asc.llnl.gov/DOE-COE-Mtg-2016/talks/2-19_Bertolli.pdf

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Porting HPC Applications to AMD Instinct™ MI300A using Unified Memory and OpenMP®;ISC High Performance 2024 Research Paper Proceedings (39th International Conference);2024-05

2. Specialized Kernels for Optimizing GPU Offload in OpenMP;Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis;2023-11-12

3. Implementing OpenMP’s SIMD Directive in LLVM’s GPU Runtime;Proceedings of the 52nd International Conference on Parallel Processing;2023-08-07

4. Maximizing Parallelism and GPU Utilization For Direct GPU Compilation Through Ensemble Execution;Proceedings of the 52nd International Conference on Parallel Processing Workshops;2023-08-07

5. High-Performance GPU-to-CPU Transpilation and Optimization via High-Level Parallel Constructs;Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming;2023-02-21