On the Convergence of the LMS Algorithm with Adaptive Learning Rate for Linear Feedforward Networks-Reference-Cited by-同舟云学术

On the Convergence of the LMS Algorithm with Adaptive Learning Rate for Linear Feedforward Networks

Published:1991-06 Issue:2 Volume:3 Page:226-245
ISSN:0899-7667
Container-title:Neural Computation
language:en
Short-container-title:Neural Computation

Author:

Luo Zhi-Quan¹

Affiliation:

1. Department of Electrical and Computer Engineering, McMaster University, Hamilton, Ontario, L8S 4L7, Canada

Abstract

We consider the problem of training a linear feedforward neural network by using a gradient descent-like LMS learning algorithm. The objective is to find a weight matrix for the network, by repeatedly presenting to it a finite set of examples, so that the sum of the squares of the errors is minimized. Kohonen showed that with a small but fixed learning rate (or stepsize) some subsequences of the weight matrices generated by the algorithm will converge to certain matrices close to the optimal weight matrix. In this paper, we show that, by dynamically decreasing the learning rate during each training cycle, the sequence of matrices generated by the algorithm will converge to the optimal weight matrix. We also show that for any given ∊ > 0 the LMS algorithm, with decreasing learning rates, will generate an ∊-optimal weight matrix (i.e., a matrix of distance at most ∊ away from the optimal matrix) after O(1/∊) training cycles. This is in contrast to Ω(1/∊log 1/∊) training cycles needed to generate an ∊-optimal weight matrix when the learning rate is kept fixed. We also give a general condition for the learning rates under which the LMS learning algorithm is guaranteed to converge to the optimal weight matrix.

Publisher

MIT Press - Journals

Subject

Cognitive Neuroscience,Arts and Humanities (miscellaneous)

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/neco.1991.3.2.226

Reference10 articles.

1. On Asymptotic Normality in Stochastic Approximation

2. Analysis of hidden units in a layered network trained to classify sonar targets

3. Increased rates of convergence through learning rate adaptation

4. An Adaptive Associative Memory Principle

5. Asymptotic Distribution of Stochastic Approximation Procedures

Cited by 81 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Performance Evaluation of 2D and 3D Beam and Channel Tracking Using Adaptive Filtering Techniques;Iranian Journal of Science and Technology, Transactions of Electrical Engineering;2024-04-20

2. Global stability of first-order methods for coercive tame functions;Mathematical Programming;2023-10-06

3. Perceptron: Learning, Generalization, Model Selection, Fault Tolerance, and Role in the Deep Learning Era;Mathematics;2022-12-13

4. The Basic Principles of Machine Learning;Artificial Intelligence/Machine Learning in Nuclear Medicine and Hybrid Imaging;2022

5. Convergence of Online Gradient Method with Momentum for BP Neural Network;Journal of Physics: Conference Series;2021-03-01