Robust Learning with Implicit Residual Networks-Reference-Cited by-同舟云学术

Robust Learning with Implicit Residual Networks

Published:2020-12-31 Issue:1 Volume:3 Page:34-55
ISSN:2504-4990
Container-title:Machine Learning and Knowledge Extraction
language:en
Short-container-title:MAKE

Author:

Reshniak Viktor^ORCID,Webster Clayton G.^ORCID

Abstract

In this effort, we propose a new deep architecture utilizing residual blocks inspired by implicit discretization schemes. As opposed to the standard feed-forward networks, the outputs of the proposed implicit residual blocks are defined as the fixed points of the appropriately chosen nonlinear transformations. We show that this choice leads to the improved stability of both forward and backward propagations, has a favorable impact on the generalization power, and allows for control the robustness of the network with only a few hyperparameters. In addition, the proposed reformulation of ResNet does not introduce new parameters and can potentially lead to a reduction in the number of required layers due to improved forward stability. Finally, we derive the memory-efficient training algorithm, propose a stochastic regularization technique, and provide numerical results in support of our findings.

Publisher

MDPI AG

Subject

General Economics, Econometrics and Finance

Link

https://www.mdpi.com/2504-4990/3/1/3/pdf

Reference44 articles.

1. Deep learning

2. Universal Function Approximation by Deep Neural Nets with Bounded Width and ReLU Activations

3. The Expressive Power of Neural Networks: A View from the Width;Lu,2017

4. Learning long-term dependencies with gradient descent is difficult

5. A Proposal on Machine Learning via Dynamical Systems

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Accuracy and Architecture Studies of Residual Neural Network Method for Ordinary Differential Equations;Journal of Scientific Computing;2023-03-28

2. Explainable Machine Learning;Machine Learning and Knowledge Extraction;2023-01-17

3. Deep learning for probabilistic salt segmentation using Bayesian inference machines;First International Meeting for Applied Geoscience & Energy Expanded Abstracts;2021-09-01