ODE-RU: a dynamical system view on recurrent neural networks-Reference-Cited by-同舟云学术

ODE-RU: a dynamical system view on recurrent neural networks

Published:2021 Issue:1 Volume:30 Page:257-271
ISSN:2688-1594
Container-title:Electronic Research Archive
language:
Short-container-title:era

Author:

Meng Pinchao, ,Wang Xinyu,Yin Weishi

Abstract

<abstract><p>The core of the demonstration of this paper is to interpret the forward propagation process of machine learning as a parameter estimation problem of nonlinear dynamical systems. This process is to establish a connection between the Recurrent Neural Network and the discrete differential equation, so as to construct a new network structure: ODE-RU. At the same time, under the inspiration of the theory of ordinary differential equations, we propose a new forward propagation mode. In a large number of simulations and experiments, the forward propagation not only shows the trainability of the new architecture, but also achieves a low training error on the basis of main-taining the stability of the network. For the problem requiring long-term memory, we specifically study the obstacle shape reconstruction problem using the backscattering far-field features data set, and demonstrate the effectiveness of the proposed architecture using the data set. The results show that the network can effectively reduce the sensitivity to small changes in the input feature. And the error generated by the ordinary differential equation cyclic unit network in inverting the shape and position of obstacles is less than $ 10^{-2} $.</p></abstract>

Publisher

American Institute of Mathematical Sciences (AIMS)

Reference32 articles.

1. J. Collins, J. Sohl-Dickstein, D. Sussillo, Capacity and trainability in recurrent neural networks, Paper presented at 5th International Conference on Learning Representations, 2017.

2. S. Hochreiter, J. Schmidhuber, Long short-term memory, Neural Comput., 9 (1997), 1735–1780. https://doi.org/10.1162/neco.1997.9.8.1735

3. K. Cho, B. V. Merrienboer, C. Gulcehre, D. Bahdanau, F. Bougares, H. Schwenk, et al., Learning phrase representations using RNN encoder-decoder for statistical machine translation, Comput. Sci., (2014), 1723–1734. https://doi.org/10.3115/v1/D14-1179

4. L. Bottou, F. E. Curtis, J. Nocedal, Optimization methods for large-scale machine learning, SIAM, 60 (2018), 223–231. https://doi.org/10.1137/16M1080173

5. B. Chang, M. Chen, E. Haber, E. H. Chi, AntisymmetricRNN: a dynamical system view on recurrent neural networks, Paper presented at 7th International Conference on Learning Representations, 2019.

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Shape reconstruction of acoustic obstacle with linear sampling method and neural network;AIMS Mathematics;2024

2. Evaluating the Impacts of Thermal Conductivity on Casson Fluid Flow Near a Slippery Sheet: Numerical Simulation Using Sixth-Kind Chebyshev Polynomials;Journal of Nonlinear Mathematical Physics;2023-10-19

3. A neural network method for time-dependent inverse source problem with limited-aperture data;Journal of Computational and Applied Mathematics;2023-03

4. Solving Inverse Scattering Problem with a Crack in Inhomogeneous Medium Based on a Convolutional Neural Network;Symmetry;2023-01-01