Author:
Alcaraz Silvia R.,Laso Ruben,Lorenzo Oscar G.,Vilariño David L.,Pena Tomás F.,Rivera Francisco F.
Abstract
AbstractThis work presents a performance-oriented study of a heterogeneous application developed with Intel OneAPI to solve two well-known diffusion problems: heat diffusion and image denoising. We have explored CPU+iGPU and CPU+FPGA schemes, applying dynamic load balancing and conducting experiments on Intel DevCloud. The results demonstrate that the CPU+iGPU scheme outperforms the execution times achieved by the fastest device when the problem is sufficiently computationally demanding. We also found that the performance of the CPU+FPGA scheme is heavily affected by bandwidth limitations and specific strategies to manage memory efficiently are required. Moreover, it was demonstrated that dynamic workload balancing is crucial due to possible performance fluctuations in any of the implicated devices. In conclusion, Intel OneAPI provides a helpful tool for multi-platform development using a unique high-level language, DPC++. However, developing specific code for each platform is necessary to achieve optimal performance.
Funder
Consellerı́a de Cultura, Educación e Ordenación Universitaria
European Regional Development Fund
Ministry of Economy and Competitiveness, Government of Spain
Universidade de Santiago de Compostela
Publisher
Springer Science and Business Media LLC