The Statistical Physics of Learning Revisited: Typical Learning Curves in Model Scenarios-Reference-Cited by-同舟云学术

The Statistical Physics of Learning Revisited: Typical Learning Curves in Model Scenarios

Published:2021 Issue: Volume: Page:128-142
ISSN:0302-9743
Container-title:Lecture Notes in Computer Science
language:
Short-container-title:

Author:

Biehl Michael

Abstract

AbstractThe exchange of ideas between computer science and statistical physics has advanced the understanding of machine learning and inference significantly. This interdisciplinary approach is currently regaining momentum due to the revived interest in neural networks and deep learning. Methods borrowed from statistical mechanics complement other approaches to the theory of computational and statistical learning. In this brief review, we outline and illustrate some of the basic concepts. We exemplify the role of the statistical physics approach in terms of a particularly important contribution: the computation of typical learning curves in student teacher scenarios of supervised learning. Two, by now classical examples from the literature illustrate the approach: the learning of a linearly separable rule by a perceptron with continuous and with discrete weights, respectively. We address these prototypical problems in terms of the simplifying limit of stochastic training at high formal temperature and obtain the corresponding learning curves.

Publisher

Springer International Publishing

Link

https://link.springer.com/content/pdf/10.1007/978-3-030-82427-3_10

Reference32 articles.

1. Hertz, J., Krogh, A., Palmer, R.G.: Introduction to the Theory of Neural Computation. Addison-Wesley (1991)

2. Springer Series in Statistics;T Hastie,2009

3. Bishop, C.: Pattern Recognition and Machine Learning. Cambridge University Press, Cambridge (2007)

4. Goodfellow, I., Bengio, Y., Courville, A.: Deep Learning. MIT Press (2016)

5. LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521, 436–444 (2015)