Affiliation:
1. Department of Theoretical Physics, University of Lund, Sölvegatan 14 A, S-223 62 Lund, Sweden
Abstract
The Langevin updating rule, in which noise is added to the weights during learning, is presented and shown to improve learning on problems with initially ill-conditioned Hessians. This is particularly important for multilayer perceptrons with many hidden layers, that often have ill-conditioned Hessians. In addition, Manhattan updating is shown to have a similar effect.
Subject
Cognitive Neuroscience,Arts and Humanities (miscellaneous)
Cited by
37 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献