Accelerating Sparse Deep Neural Network Inference Using GPU Tensor Cores-Reference-Cited by-同舟云学术

Accelerating Sparse Deep Neural Network Inference Using GPU Tensor Cores

Published:2022-09-19 Issue: Volume: Page:
ISSN:
Container-title:2022 IEEE High Performance Extreme Computing Conference (HPEC)
language:
Short-container-title:

Author:

Sun Yufei¹,Zheng Long¹,Wang Qinggang¹,Ye Xiangyu¹,Huang Yu¹,Yao Pengcheng¹,Liao Xiaofei¹,Jin Hai¹

Affiliation:

1. School of Computer Science and Technology, Huazhong University of Science and Technology,National Engineering Research Center for Big Data Technology and System/Services Computing Technology and System Lab/Cluster and Grid Computing Laboratory,Wuhan,China,430074

Publisher

IEEE

Link

http://xplorestaging.ieee.org/ielx7/9926284/9926287/09926300.pdf?arnumber=9926300

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Accelerating ML Workloads using GPU Tensor Cores: The Good, the Bad, and the Ugly;Proceedings of the 15th ACM/SPEC International Conference on Performance Engineering;2024-05-07

2. DTC-SpMM: Bridging the Gap in Accelerating General Sparse Matrix Multiplication with Tensor Cores;Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3;2024-04-27

3. Mixed-Precision S/DGEMM Using the TF32 and TF64 Frameworks on Low-Precision AI Tensor Cores;Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis;2023-11-12

4. DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector Multiplication;Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis;2023-11-11

5. GLARE: Accelerating Sparse DNN Inference Kernels with Global Memory Access Reduction;2023 IEEE High Performance Extreme Computing Conference (HPEC);2023-09-25