Large-scale distributed linear algebra with tensor processing units-Reference-Cited by-同舟云学术

Large-scale distributed linear algebra with tensor processing units

Published:2022-08-08 Issue:33 Volume:119 Page:
ISSN:0027-8424
Container-title:Proceedings of the National Academy of Sciences
language:en
Short-container-title:Proc. Natl. Acad. Sci. U.S.A.

Author:

Lewis Adam G. M.¹²^ORCID,Beall Jackson¹²,Ganahl Martin¹²,Hauru Markus²^ORCID,Mallick Shrestha Basu²,Vidal Guifre²³

Affiliation:

1. Simulation & Optimization Team, Sandbox AQ, Palo Alto, CA 94301;

2. Sandbox Alphabet X, The Moonshot Factory, Mountain View, CA 94043;

3. Google Quantum AI, Google LLC, Santa Barbara, CA 93111

Abstract

We have repurposed Google tensor processing units (TPUs), application-specific chips developed for machine learning, into large-scale dense linear algebra supercomputers. The TPUs’ fast intercore interconnects (ICIs), physically two-dimensional network topology, and high-bandwidth memory (HBM) permit distributed matrix multiplication algorithms to rapidly become computationally bound. In this regime, the matrix-multiply units (MXUs) dominate the runtime, yielding impressive scaling, performance, and raw size: Operating in float32 precision, a full 2,048-core pod of third-generation TPUs can multiply two matrices with linear sizeN=220=1,048,576in about 2 min. Via curated algorithms emphasizing large, single-core matrix multiplications, other tasks in dense linear algebra can similarly scale. As examples, we present 1) QR decomposition; 2) resolution of linear systems; and 3) the computation of matrix functions by polynomial iteration, demonstrated by the matrix polar factorization.

Publisher

Proceedings of the National Academy of Sciences

Subject

Multidisciplinary

Link

https://pnas.org/doi/pdf/10.1073/pnas.2122762119

Reference25 articles.

1. Learning data-driven discretizations for partial differential equations

2. Machine learning guided aptamer refinement and discovery

3. Machine learning–accelerated computational fluid dynamics

4. Kohn-Sham Equations as Regularizer: Building Prior Knowledge into Machine-Learned Physics

5. T. Lu, Y. F. Chen, B. Hechtman, T. Wang, J. Anderson, Large-scale discrete Fourier transform on TPUs. arXiv [Preprint] (2020). https://doi.org/10.48550/arXiv.2002.03260. Accessed 26 July 2022.

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Solving the discretised neutron diffusion equations using neural networks;International Journal for Numerical Methods in Engineering;2023-07-12

2. Seismic imaging of medical ultrasound data: Towards in vivo applications;Europhysics Letters;2023-05-30

3. Fast time evolution of matrix product states using the QR decomposition;Physical Review B;2023-04-21

4. Accelerated linear algebra compiler for computationally efficient numerical models: Success and potential area of improvement;PLOS ONE;2023-02-24

5. Density Matrix Renormalization Group with Tensor Processing Units;PRX Quantum;2023-02-16