Author:
Solomonik Edgar,Demmel James
Publisher
Springer Berlin Heidelberg
Reference20 articles.
1. Agarwal, R.C., Balle, S.M., Gustavson, F.G., Joshi, M., Palkar, P.: A three-dimensional approach to parallel matrix multiplication. IBM J. Res. Dev. 39, 575–582 (1995)
2. Aggarwal, A., Chandra, A.K., Snir, M.: Communication complexity of PRAMs. Theoretical Computer Science 71(1), 3–28 (1990)
3. Ashcraft, C.: A taxonomy of distributed dense LU factorization methods. Boeing Computer Services Technical Report ECA-TR-161 (March 1991)
4. Ashcraft, C.: The fan-both family of column-based distributed Cholesky factorization algorithms. In: Alan George, J.R.G., Liu, J.W.H. (eds.) Graph Theory and Sparse Matrix Computation. IMA Volumes in Mathematics and its Applications, vol. 56, pp. 159–190. Springer, Heidelberg (1993)
5. Ballard, G., Demmel, J., Holtz, O., Schwartz, O.: Minimizing communication in numerical linear algebra. To appear in SIAM J. Mat. Anal. Appl., UCB Technical Report EECS-2009-62 (2010)
Cited by
129 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. AutoDDL: Automatic Distributed Deep Learning With Near-Optimal Bandwidth Cost;IEEE Transactions on Parallel and Distributed Systems;2024-08
2. Alternative Basis Matrix Multiplication is Fast and Stable;2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS);2024-05-27
3. Parallel and Distributed Graph Neural Networks: An In-Depth Concurrency Analysis;IEEE Transactions on Pattern Analysis and Machine Intelligence;2024-05
4. 2D-SAZD: A Novel 2D Coded Distributed Computing Framework for Matrix-Matrix Multiplication;IEEE Transactions on Services Computing;2024-05
5. Fast Kronecker Matrix-Matrix Multiplication on GPUs;Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming;2024-02-20