A Scaled Conjugate Gradient Algorithm for Fast Supervised Learning-Reference-Cited by-同舟云学术

A Scaled Conjugate Gradient Algorithm for Fast Supervised Learning

Published:1990-11-01 Issue:339 Volume:19 Page:
ISSN:2245-9316
Container-title:DAIMI Report Series
language:
Short-container-title:DPB

Author:

Møller Martin F.

Abstract

A supervised learning algorithm (Scaled Conjugate Gradient, SCG) with superlinear convergence rate is introduced. The algorithm is based upon a class of optimization techniques well known in numerical analysis as the Conjugate Gradient Methods. SCG uses second order information from the neural network but requires only O(N) memory usage, where N is the number of weights in the network. The performance of SCG is benchmarked against the performance of the standard backpropagation algorithm (BP), the conjugate gradient backpropagation (CGB) and the one-step Broyden-Fletcher-Goldfarb-Shanno memoryless quasi-Newton algorithm (BFGS). SCG yields a speed-up of at least an order of magnitude relative to BP. The speed-up depends on the convergence criterion, i.e., the bigger demand for reduction in error the bigger the speed-up. SCG is fully automated including no user dependent parameters and avoids a time consuming line-search, which CGB and BFGS use in each iteration in order to determine an appropriate step size. Incorporating problem dependent structural information in the architecture of a neural network often lowers the overall complexity. The smaller the complexity of the neural network relative to the problem domain, the bigger the possibility that the weight space contains long ravines characterized by sharp curvature. While BP is inefficient on these ravine phenomena, it is shown that SCG handles them effectively.

Publisher

Det Kgl. Bibliotek/Royal Danish Library

Cited by 29 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Prediction of residual stresses in welded structures based on neural network: a review;Journal of Materials Science;2024-09-13

2. Artificial Neural Networks and Discrete Choice Models: Comparing and Contrasting;Smart Innovation, Systems and Technologies;2024

3. Artificial Neural Networks and Discrete Choice Models;Management and Marketing for Improved Retail Competitiveness and Performance;2023-06-30

4. Application of soft computing approaches for modeling annular pressure loss of slim-hole wells in one of Iranian central oil fields;Soft Computing;2023-03-09

5. Centroid-Based Differential Evolution with Composite Trial Vector Generation Strategies for Neural Network Training;Applications of Evolutionary Computation;2023