Statistical Mechanics of Deep Learning-Reference-Cited by-同舟云学术

Statistical Mechanics of Deep Learning

Published:2020-03-10 Issue:1 Volume:11 Page:501-528
ISSN:1947-5454
Container-title:Annual Review of Condensed Matter Physics
language:en
Short-container-title:Annu. Rev. Condens. Matter Phys.

Author:

Bahri Yasaman¹,Kadmon Jonathan²,Pennington Jeffrey¹,Schoenholz Sam S.¹,Sohl-Dickstein Jascha¹,Ganguli Surya¹²

Affiliation:

1. Google Brain, Google Inc., Mountain View, California 94043, USA

2. Department of Applied Physics, Stanford University, Stanford, California 94035, USA;

Abstract

The recent striking success of deep neural networks in machine learning raises profound questions about the theoretical principles underlying their success. For example, what can such deep networks compute? How can we train them? How does information propagate through them? Why can they generalize? And how can we teach them to imagine? We review recent work in which methods of physical analysis rooted in statistical mechanics have begun to provide conceptual insights into these questions. These insights yield connections between deep learning and diverse physical and mathematical topics, including random landscapes, spin glasses, jamming, dynamical phase transitions, chaos, Riemannian geometry, random matrix theory, free probability, and nonequilibrium statistical mechanics. Indeed, the fields of statistical mechanics and machine learning have long enjoyed a rich history of strongly coupled interactions, and recent advances at the intersection of statistical mechanics and deep learning suggest these interactions will only deepen going forward.

Publisher

Annual Reviews

Subject

Condensed Matter Physics,General Materials Science

Link

https://www.annualreviews.org/doi/pdf/10.1146/annurev-conmatphys-031119-050745

Cited by 128 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Data science education in undergraduate physics: Lessons learned from a community of practice;American Journal of Physics;2024-09-01

2. Teaching research of probability theory and mathematical statistics from the perspective of artificial intelligence;Proceedings of the 2024 Guangdong-Hong Kong-Macao Greater Bay Area International Conference on Education Digitalization and Computer Science;2024-07-26

3. Weight fluctuations in deep linear neural networks and a derivation of the inverse-variance flatness relation;Physical Review Research;2024-07-25

4. How Deep Neural Networks Learn Compositional Data: The Random Hierarchy Model;Physical Review X;2024-07-01

5. Characterization of overparametrization in the simulation of realistic quantum systems;Physical Review A;2024-06-14