Hands on with OpenMP4.5 and Unified Memory: Developing Applications for IBM’s Hybrid CPU + GPU Systems (Part II)-Reference-Cited by-同舟云学术

Hands on with OpenMP4.5 and Unified Memory: Developing Applications for IBM’s Hybrid CPU + GPU Systems (Part II)

Published:2017 Issue: Volume: Page:17-29
ISSN:0302-9743
Container-title:Scaling OpenMP for Exascale Performance and Portability
language:
Short-container-title:

Author:

Grinberg Leopold,Bertolli Carlo,Haque Riyaz

Publisher

Springer International Publishing

Link

http://link.springer.com/content/pdf/10.1007/978-3-319-65578-9_2

Reference8 articles.

1. Using shared memory in CUDA C/C++, April 2017. https://devblogs.nvidia.com/parallelforall/using-shared-memory-cuda-cc/

2. Edwards, H.C., Trott, C., Sunderland, D.: Kokkos, a manycore device performance portability library for C++ HPC applications, March 2014. http://on-demand.gputechconf.com/gtc/2014/presentations/S4213-kokkos-manycore-device-perf-portability-library-hpc-apps.pdf

3. Grinberg, L., Bertolli, C., Haque, R.: Hands on with openmp4.5 and unified memory: developing applications for IBM’S hybrid CPU + GPU systems (part I). Submitted for IWOMP 2017

4. CUDA C/C++ programming guide - shared memory section, April 2017. http://docs.nvidia.com/cuda/cuda-c-programming-guide/#shared-memory

5. OpenMP Language Committee: OpenMP Application Program Interface, version 4.5 edn., July 2013. http://www.openmp.org/mp-documents/openmp-4.5.pdf

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Porting HPC Applications to AMD Instinct™ MI300A using Unified Memory and OpenMP®;ISC High Performance 2024 Research Paper Proceedings (39th International Conference);2024-05

2. Experimental Characterization of OpenMP Offloading Memory Operations and Unified Shared Memory Support;OpenMP: Advanced Task-Based, Device and Compiler Programming;2023

3. Toward Supporting Multi-GPU Targets via Taskloop and User-Defined Schedules;OpenMP: Portable Multi-Level Parallelism on Modern Systems;2020

4. A Case Study of Porting HPGMG from CUDA to OpenMP Target Offload;OpenMP: Portable Multi-Level Parallelism on Modern Systems;2020

5. Performance evaluation of Unified Memory with prefetching and oversubscription for selected parallel CUDA applications on NVIDIA Pascal and Volta GPUs;The Journal of Supercomputing;2019-08-20