Efficient Quantized Sparse Matrix Operations on Tensor Cores-Reference-Cited by-同舟云学术

Efficient Quantized Sparse Matrix Operations on Tensor Cores

Published:2022-11 Issue: Volume: Page:
ISSN:
Container-title:SC22: International Conference for High Performance Computing, Networking, Storage and Analysis
language:
Short-container-title:

Author:

Li Shigang¹,Osawa Kazuki²,Hoefler Torsten²

Affiliation:

1. School of Computer Science, Beijing University of Posts and Telecommunications,Beijing,China

2. Department of Computer Science, ETH Zurich,Zurich,Switzerland

Publisher

IEEE

Link

Reference77 articles.

2. Bayesian bits: Unifying quantization and pruning;van baalen;Advances in neural information processing systems,2020

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Two-Face: Combining Collective and One-Sided Communication for Efficient Distributed SpMM;Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2;2024-04-27

3. A Row Decomposition-based Approach for Sparse Matrix Multiplication on GPUs;Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming;2024-02-20

5. DASP: Specific Dense Matrix Multiply-Accumulate Units Accelerated General Sparse Matrix-Vector Multiplication;Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis;2023-11-11