Author:
Barik Rajkishore,Shpeisman Tatiana,Rong Hongbo,Hu Chunling,Lee Victor W.,Anderson Todd A.,Henry Greg,Liu Hai,Wu Youfeng,Petersen Paul,Lowney Geoff
Publisher
Springer International Publishing
Reference46 articles.
1. Effective Use of the Intel Compiler’s Offload Features.
https://software.intel.com/en-us/articles/effective-use-of-the-intel-compilers-offload-features
2. How to Overlap Data Transfers in CUDA C/C++.
https://devblogs.nvidia.com/parallelforall/how-overlap-data-transfers-cuda-cc/
3. Intel Math Kernel Library Automatic Offload for Intel Xeon Phi Coprocessor.
https://software.intel.com/en-us/articles/math-kernel-library-automatic-offload-for-intel-xeon-phi-coprocessor
4. AlSaber, N., Kulkarni, M.: Semcache: semantics-aware caching for efficient GPU offloading. In: Proceedings of the 27th International ACM Conference on International Conference on Supercomputing, ICS 2013, pp. 421–432. ACM, New York (2013)
5. Augonnet, C., Thibault, S., Namyst, R., Wacrenier, P.-A.: StarPU: a unified platform for task scheduling on heterogeneous multicore architectures. Concurr. Comput.: Pract. Exp. 23(2), 187–198 (2011)