Three learning stages and accuracy–efficiency tradeoff of restricted Boltzmann machines-Reference-Cited by-同舟云学术

Three learning stages and accuracy–efficiency tradeoff of restricted Boltzmann machines

Published:2022-09-17 Issue:1 Volume:13 Page:
ISSN:2041-1723
Container-title:Nature Communications
language:en
Short-container-title:Nat Commun

Author:

Dabelow Lennart^ORCID,Ueda Masahito^ORCID

Abstract

AbstractRestricted Boltzmann Machines (RBMs) offer a versatile architecture for unsupervised machine learning that can in principle approximate any target probability distribution with arbitrary accuracy. However, the RBM model is usually not directly accessible due to its computational complexity, and Markov-chain sampling is invoked to analyze the learned probability distribution. For training and eventual applications, it is thus desirable to have a sampler that is both accurate and efficient. We highlight that these two goals generally compete with each other and cannot be achieved simultaneously. More specifically, we identify and quantitatively characterize three regimes of RBM learning: independent learning, where the accuracy improves without losing efficiency; correlation learning, where higher accuracy entails lower efficiency; and degradation, where both accuracy and efficiency no longer improve or even deteriorate. These findings are based on numerical experiments and heuristic arguments.

Funder

MEXT | Japan Society for the Promotion of Science

Publisher

Springer Science and Business Media LLC

Subject

General Physics and Astronomy,General Biochemistry, Genetics and Molecular Biology,General Chemistry,Multidisciplinary

Link

https://www.nature.com/articles/s41467-022-33126-x.pdf

Reference64 articles.

1. Ackley, D. H., Hinton, G. E. & Sejnowski, T. J. A learning algorithm for Boltzmann machines. Cogn. Sci. 9, 147 (1985).

2. Smolensky, P. Information processing in dynamical systems: foundations of harmony theory. In: Rumelhart, D. E. & McClelland J. L. (eds.) Parallel Distributed Processing: Explorations in the Microstructure of Cognition, Vol. 1, pp. 194–281 (MIT Press, 1986).

3. Hinton, G. E. & Salakhutdinov, R. R. Reducing the dimensionality of data with neural networks. Science 313, 504 (2006).

4. Gehler, P. V., Holub, A. D. & Welling, M. The rate adapting Poisson model for information retrieval and object recognition. In: Proceedings of the 23rd International Conference on Machine Learning, ICML ’06 p. 337-344 (Association for Computing Machinery, New York, NY, USA, 2006).

5. Hinton, G. E. To recognize shapes, first learn to generate images in computational neuroscience: theoretical insights into brain function. In: Cisek, P., Drew, T. & Kalaska J. F. Progress in Brain Research, Vol. 165, pp. 535–547 (Elsevier, 2007).

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. CMOS plus stochastic nanomagnets enabling heterogeneous computers for probabilistic inference and learning;Nature Communications;2024-03-27

2. Adaptive Stochastic Conjugate Gradient Optimization for Backpropagation Neural Networks;IEEE Access;2024

3. Convolution neural network and deep learning;Artificial Intelligence and Image Processing in Medical Imaging;2024

4. Zeroth, first, and second-order phase transitions in deep neural networks;Physical Review Research;2023-12-14

5. How deep is the brain? The shallow brain hypothesis;Nature Reviews Neuroscience;2023-10-27