Publisher
Zhejiang University Press
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing
Reference16 articles.
1. Alfieri RA, 1994. An efficient kernel-based implementation of POSIX threads. Proc USENIX Summer Technical Conf, p.59–72.
2. Arevalo A, Matinata RM, Pandian M, et al., 2000. Programming the cell broadband engine examples and best practices. ACM Workshop. Available from https://www.autodesk.com/research/publications/programming-the-cell-broadband [Accessed on Aug. 25, 2022].
3. Fang JB, Varbanescu AL, Sips H, 2011. A comprehensive performance comparison of CUDA and OpenCL. Int Conf on Parallel Processing, p.216–225. https://doi.org/10.1109/ICPP.2011.45
4. Fang JB, Huang C, Tang T, et al., 2020. Parallel programming models for heterogeneous many-cores: a comprehensive survey. CCF Trans High Perform Comput, 2(4):382–400. https://doi.org/10.1007/s42514-020-00039-4
5. Jääskeläinen P, de la Lama CS, Schnetter E, et al., 2015. pocl: a performance-portable OpenCL implementation. Int J Parall Program, 43(5):752–785. https://doi.org/10.1007/s10766-014-0320-y
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Optimizing Stencil Computation on Multi-core DSPs;Proceedings of the 53rd International Conference on Parallel Processing;2024-08-12
2. Optimizing General Matrix Multiplications on Modern Multi-core DSPs;2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS);2024-05-27
3. thSORT: an efficient parallel sorting algorithm on multi-core DSPs;CCF Transactions on High Performance Computing;2024-01-19
4. Efficiently Running SpMV on Multi-core DSPs for Banded Matrix;Lecture Notes in Computer Science;2024
5. Parallel Implementation of SHA256 on Multizone Heterogeneous Systems;2023 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom);2023-12-21