Affiliation:
1. Department of Computer Science, University of Houston, Houston, TX 77004, USA
Abstract
Existing studies show that using single GPU can lead to obtaining significant performance gains. We should be able to achieve further performance speedup if we use more than one GPU. Heterogeneous processors consisting of multiple CPUs and GPUs offer immense potential and are often considered as a leading candidate for porting complex scientific applications. Unfortunately programming heterogeneous systems requires more effort than what is required for traditional multicore systems. Directive-based programming approaches are being widely adopted since they make it easy to use/port/maintain application code. OpenMP and OpenACC are two popular models used to port applications to accelerators. However, neither of the models provides support for multiple GPUs. A plausible solution is to use combination of OpenMP and OpenACC that forms a hybrid model; however, building this model has its own limitations due to lack of necessary compilers’ support. Moreover, the model also lacks support for direct device-to-device communication. To overcome these limitations, an alternate strategy is to extend OpenACC by proposing and developing extensions that follow a task-based implementation for supporting multiple GPUs. We critically analyze the applicability of the hybrid model approach and evaluate the proposed strategy using several case studies and demonstrate their effectiveness.
Subject
Computer Science Applications,Software
Cited by
11 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Accelerated CFD computations on multi-GPU using OpenMP and OpenACC;Sādhanā;2024-02-22
2. Accelerated CFD Computations on Multi-GPU Using OpenMP and OpenACC;Lecture Notes in Mechanical Engineering;2024
3. Data mapping strategies for multi-GPU implementation of a seismic application;Anais do XXIV Simpósio em Sistemas Computacionais de Alto Desempenho (SSCAD 2023);2023-10-17
4. Feasibility Studies in Multi-GPU Target Offloading;OpenMP in a Modern World: From Multi-device Support to Meta Programming;2022
5. Performance Characteristics of Virtualized GPUs for Deep Learning;2020 IEEE/ACM International Workshop on Interoperability of Supercomputing and Cloud Technologies (SuperCompCloud);2020-11