Author:
Demmel James,Eliahu David,Fox Armando,Kamil Shoaib,Lipshitz Benjamin,Schwartz Oded,Spillinger Omer
Cited by
79 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. AutoDDL: Automatic Distributed Deep Learning With Near-Optimal Bandwidth Cost;IEEE Transactions on Parallel and Distributed Systems;2024-08
2. Fault-Tolerant Parallel Integer Multiplication;Proceedings of the 36th ACM Symposium on Parallelism in Algorithms and Architectures;2024-06-17
3. Fast Kronecker Matrix-Matrix Multiplication on GPUs;Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming;2024-02-20
4. Communication Lower Bounds and Optimal Algorithms for Multiple Tensor-Times-Matrix Computation;SIAM Journal on Matrix Analysis and Applications;2024-02-06
5. Pebbling Game and Alternative Basis for High Performance Matrix Multiplication;SIAM Journal on Scientific Computing;2023-11-15