Abstract
Abstract
GPGPU (General Purpose Computing on Graphics Processing Units) has been widely applied to high performance computing. However, GPU architecture and programming model are different from that of traditional CPU. Accordingly, it is rather challenging to develop efficient GPU applications. This paper focuses on the key techniques of programming model and compiler optimization for many-core GPU, and addresses a number of key theoretical and technical issues. This paper proposes a many-threaded programming model ab-Stream, which would transparentize architecture differences and provide an easy to parallel, easy to program, easy to extend and easy to tune programming model. In addition, this paper proposes memory optimization and data transfer transformation according to data classification. Firstly, this paper proposes data layout pruning based on classification memory, and then proposes Ta T (Transfer after Transformed) for transferring Strided data between CPU and GPU. Experimental results demonstrate that proposed techniques would significantly improve performance for GPGPU applications.
Subject
General Physics and Astronomy
Reference15 articles.
1. Accelerating the SCE-UA Global Optimization Method Based on Multi-Core CPU and Many-Core GPU;Kan;Advances in Meteorology,2016
2. High performance in silico virtual drug screening on many-core processors;Simon;International Journal of High Performance Computing Applications,2015
3. GPU accelerated generation of digitally reconstructed radiographs for 2-D/3-D image registration;Dorgham;IEEE Transactions on Biomedical Engineering,2012
4. Mapping high-fidelity volume rendering for medical imaging to CPU, GPU and many-core architectures;Smelyanskiy;IEEE Transactions on Visualization and Computer Graphics,2009
5. Designing fast architecture-sensitive tree search on modern multicore/many-core processors;Changkyu;ACM Transactions on Database Systems (TODS),2011
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Compiling with MultiCores;2023 2nd International Conference for Innovation in Technology (INOCON);2023-03-03
2. A Novel Optimization for GPU Mining Using Overclocking and Undervolting;Sustainability;2022-07-16