Analysis of Task Offloading for Accelerators

Author:

Ferrer Roger,Beltran Vicenç,Gonzàlez Marc,Martorell Xavier,Ayguadé Eduard

Publisher

Springer Berlin Heidelberg

Reference32 articles.

1. Chen, T., Raghavan, R., Dale, J., Iwata, E.: Cell Broadband Engine Architecture and its first implementation. IBM Developer Works (November 2005)

2. NVIDIA corporation: NVIDIA CUDA Compute Unified Device Architecture Version 2.0 (2008)

3. NVIDIA corporation: NVIDIA Tesla GPU Computing Technical Brief (2008)

4. OpenMP Architecture Review Board: OpenMP Application Program Interface. Version 3.0 (May 2008), http://www.openmp.org

5. Ayguadé, E., Copty, N., Duran, A., Hoeflinger, J., Lin, Y., Massaioli, F., Teruel, X., Unnikrishnan, P., Zhang, G.: The Design of OpenMP Tasks. IEEE Transactions on Parallel and Distributed Systems 20(3), 404–418 (2009)

Cited by 5 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Frustrated With MPI+Threads? Try MPIxThreads!;Proceedings of the 30th European MPI Users' Group Meeting;2023-09-11

2. A Vector-Length Agnostic Compiler for the Connex-S Accelerator with Scratchpad Memory;ACM Transactions on Embedded Computing Systems;2020-11-30

3. Polyhedral parallel code generation for CUDA;ACM Transactions on Architecture and Code Optimization;2013-01

4. OpenMP Extensions for Heterogeneous Architectures;OpenMP in the Petascale Era;2011

5. Parallel Programming Models for Heterogeneous Multicore Architectures;IEEE Micro;2010-09

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3