CWLP: coordinated warp scheduling and locality-protected cache allocation on GPUs-Reference-Cited by-同舟云学术

CWLP: coordinated warp scheduling and locality-protected cache allocation on GPUs

Published:2018-02 Issue:2 Volume:19 Page:206-220
ISSN:2095-9184
Container-title:Frontiers of Information Technology & Electronic Engineering
language:en
Short-container-title:Frontiers Inf Technol Electronic Eng

Author:

Zhang Yang^ORCID,Xing Zuo-cheng,Liu Cang,Tang Chuan

Funder

National Natural Science Foundation of China

the Specialized Research Fund for the Doctoral Program of Higher Education, China

Publisher

Zhejiang University Press

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing

Link

http://link.springer.com/article/10.1631/FITEE.1700059/fulltext.html

Reference27 articles.

1. Bakhoda A, Yuan G, Fung W, et al., 2009. Analyzing CUDA workloads using a detailed GPU simulator. ISPASS IEEE Int Symp on Performance Analysis of Systems and Software, p.163–174. https://doi.org/10.1109/ISPASS.2009.4919648

2. Che S, Boyer M, Meng J, et al., 2009. Rodinia: a benchmark suite for heterogeneous computing. IISWC IEEE Int Symp on Workload Characterization, p.44–54. https://doi.org/10.1109/IISWC.2009.5306797

3. Chen J, Tao X, Yang Z, et al., 2013. Guided region-based GPU scheduling: utilizing multi-thread parallelism to hide memory latency. IEEE 27th Int Symp on Parallel & Distributed Processing, p.441–451. https://doi.org/10.1109/IPDPS.2013.95

4. Chen X, Chang L, Rodrigues C, et al., 2014. Adaptive cache management for energy-efficient GPU computing. Proc 47th Annual IEEE/ACM Int Symp on Microarchitecture, p.343–355. https://doi.org/10.1109/MICRO.2014.11

5. Dally W, Labonte F, Das A, et al., 2003. Merrimac: supercomputing with streams. Proc ACM/IEEE Conf on Supercomputing, Article 35. https://doi.org/10.1145/1048935.1050187

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. POTDP: Research GPU Performance Optimization Method based on Thread Dynamic Programming;2022 IEEE 4th International Conference on Power, Intelligent Computing and Systems (ICPICS);2022-07-29

2. A Survey of GPGPU Parallel Processing Architecture Performance Optimization;2021 IEEE/ACIS 20th International Fall Conference on Computer and Information Science (ICIS Fall);2021-10-13

3. A novel warp scheduling scheme considering long-latency operations for high-performance GPUs;The Journal of Supercomputing;2019-11-23