Free launch-Reference-Cited by-同舟云学术

Free launch

Published:2015-12-05 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 48th International Symposium on Microarchitecture
language:
Short-container-title:

Author:

Chen Guoyang¹,Shen Xipeng¹

Affiliation:

1. North Carolina State University, Raleigh, NC

Funder

Google Faculty Award

Nvidia

National Science Foundation

DOE Early Career Award

IBM CAS Fellowship

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/2830772.2830818

Reference28 articles.

1. M. Kulkarni M. Burtscher K. Pingali and C. Cascaval "Lonestar: A suite of parallel irregular programs " in Proceedings of IEEE International Symposium on Performance Analysis of Systems and Software 2009. M. Kulkarni M. Burtscher K. Pingali and C. Cascaval "Lonestar: A suite of parallel irregular programs " in Proceedings of IEEE International Symposium on Performance Analysis of Systems and Software 2009.

2. S. Jones "Introduction to dynamic parallelism " in Nvidia GPU Technology Conference (San Jose CA) May 2012. S. Jones "Introduction to dynamic parallelism " in Nvidia GPU Technology Conference (San Jose CA) May 2012.

3. "OpenCL." http://www.khronos.org/opencl/. "OpenCL." http://www.khronos.org/opencl/.

4. J. Wang and S. Yalamanchili "Characterization and analysis of dynamic parallelism in unstructured gpu applications " in 2014 IEEE International Symposium on Workload Characterization October 2014. J. Wang and S. Yalamanchili "Characterization and analysis of dynamic parallelism in unstructured gpu applications " in 2014 IEEE International Symposium on Workload Characterization October 2014.

5. J. Wang N. Rubin A. Sidelnik and S. Yalamanchili "Dynamic thread block launch: A lightweight execution mechanism to support irregular applications on gpus " in Proceeding of the 42nd Annual International Symposium on Computer Architecuture (ISCA-42) June 2015. 10.1145/2749469.2750393 J. Wang N. Rubin A. Sidelnik and S. Yalamanchili "Dynamic thread block launch: A lightweight execution mechanism to support irregular applications on gpus " in Proceeding of the 42nd Annual International Symposium on Computer Architecuture (ISCA-42) June 2015. 10.1145/2749469.2750393

Cited by 35 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. BLP: Block-Level Pipelining for GPUs;Proceedings of the 21st ACM International Conference on Computing Frontiers;2024-05-07

2. BTO, Block and Thread Optimization of GPU Kernels on Geophysical Exploration;2024 32nd Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP);2024-03-20

3. Paella: Low-latency Model Serving with Software-defined GPU Scheduling;Proceedings of the 29th Symposium on Operating Systems Principles;2023-10-23

4. Optimization Techniques for GPU Programming;ACM Computing Surveys;2023-03-16

5. A Compiler Framework for Optimizing Dynamic Parallelism on GPUs;2022 IEEE/ACM International Symposium on Code Generation and Optimization (CGO);2022-04-02