Training with Noise is Equivalent to Tikhonov Regularization-Reference-Cited by-同舟云学术

Training with Noise is Equivalent to Tikhonov Regularization

Published:1995-01 Issue:1 Volume:7 Page:108-116
ISSN:0899-7667
Container-title:Neural Computation
language:en
Short-container-title:Neural Computation

Author:

Bishop Chris M.¹

Affiliation:

1. Neural Computing Research Group, Department of Computer Science, Aston University, Birmingham, B4 7ET, U.K.

Abstract

It is well known that the addition of noise to the input data of a neural network during training can, in some circumstances, lead to significant improvements in generalization performance. Previous work has shown that such training with noise is equivalent to a form of regularization in which an extra term is added to the error function. However, the regularization term, which involves second derivatives of the error function, is not bounded below, and so can lead to difficulties if used directly in a learning algorithm based on error minimization. In this paper we show that for the purposes of network training, the regularization term can be reduced to a positive semi-definite form that involves only first derivatives of the network mapping. For a sum-of-squares error function, the regularization term belongs to the class of generalized Tikhonov regularizers. Direct minimization of the regularized error function provides a practical alternative to training with noise.

Publisher

MIT Press - Journals

Subject

Cognitive Neuroscience,Arts and Humanities (miscellaneous)

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/neco.1995.7.1.108

Reference16 articles.

1. Improving the Generalization Properties of Radial Basis Function Neural Networks

2. Exact Calculation of the Hessian Matrix for the Multilayer Perceptron

3. Curvature-driven smoothing: a learning algorithm for feedforward networks

4. Neural Networks and the Bias/Variance Dilemma

Cited by 714 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Unifying mixed gas adsorption in molecular sieve membranes and MOFs using machine learning;Separation and Purification Technology;2025-01

2. Physics-driven deep-learning for marine CSEM data inversion;Journal of Applied Geophysics;2024-10

3. Data generation for exploration geochemistry: Past, present and future;Applied Geochemistry;2024-10

4. Chaotic computing cell based on nanostructured phase-change materials;Journal of Computational Electronics;2024-09-13

5. Predicting binary neutron star postmerger spectra using artificial neural networks;Physical Review D;2024-09-05