Analysis of Gradient Vanishing of RNNs and Performance Comparison-Reference-Cited by-同舟云学术

Analysis of Gradient Vanishing of RNNs and Performance Comparison

Published:2021-10-25 Issue:11 Volume:12 Page:442
ISSN:2078-2489
Container-title:Information
language:en
Short-container-title:Information

Author:

Noh Seol-Hyun^ORCID

Abstract

A recurrent neural network (RNN) combines variable-length input data with a hidden state that depends on previous time steps to generate output data. RNNs have been widely used in time-series data analysis, and various RNN algorithms have been proposed, such as the standard RNN, long short-term memory (LSTM), and gated recurrent units (GRUs). In particular, it has been experimentally proven that LSTM and GRU have higher validation accuracy and prediction accuracy than the standard RNN. The learning ability is a measure of the effectiveness of gradient of error information that would be backpropagated. This study provided a theoretical and experimental basis for the result that LSTM and GRU have more efficient gradient descent than the standard RNN by analyzing and experimenting the gradient vanishing of the standard RNN, LSTM, and GRU. As a result, LSTM and GRU are robust to the degradation of gradient descent even when LSTM and GRU learn long-range input data, which means that the learning ability of LSTM and GRU is greater than standard RNN when learning long-range input data. Therefore, LSTM and GRU have higher validation accuracy and prediction accuracy than the standard RNN. In addition, it was verified whether the experimental results of river-level prediction models, solar power generation prediction models, and speech signal models using the standard RNN, LSTM, and GRUs are consistent with the analysis results of gradient vanishing.

Publisher

MDPI AG

Subject

Information Systems

Link

https://www.mdpi.com/2078-2489/12/11/442/pdf

Reference22 articles.

1. Learning representations by back-propagating errors

2. Finding Structure in Time

3. Generalization of backpropagation with application to a recurrent gas market model

4. Deep learning

Cited by 51 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Efficient and Robust Arabic Automotive Speech Command Recognition System;Algorithms;2024-09-02

2. Optimizing Continuous Casting through Cyber–Physical System;Processes;2024-08-20

3. PmForecast: leveraging temporal LSTM to deliver in situ air quality predictions;Environmental Science and Pollution Research;2024-08-10

4. Anomaly Detection on Natural Gas Pipeline Operational Data Using GRU Method;2024 International Conference on Data Science and Its Applications (ICoDSA);2024-07-10

5. Precision forecasting of grinding wheel Wear: A TransBiGRU model for advanced industrial predictive maintenance;Measurement;2024-07