Neural network approximation-Reference-Cited by-同舟云学术

Neural network approximation

Published:2021-05 Issue: Volume:30 Page:327-444
ISSN:0962-4929
Container-title:Acta Numerica
language:en
Short-container-title:Acta Numerica

Author:

DeVore Ronald,Hanin Boris,Petrova Guergana

Abstract

Neural networks (NNs) are the method of choice for building learning algorithms. They are now being investigated for other numerical tasks such as solving high-dimensional partial differential equations. Their popularity stems from their empirical success on several challenging learning problems (computer chess/Go, autonomous navigation, face recognition). However, most scholars agree that a convincing theoretical explanation for this success is still lacking. Since these applications revolve around approximating an unknown function from data observations, part of the answer must involve the ability of NNs to produce accurate approximations.This article surveys the known approximation properties of the outputs of NNs with the aim of uncovering the properties that are not present in the more traditional methods of approximation used in numerical analysis, such as approximations using polynomials, wavelets, rational functions and splines. Comparisons are made with traditional approximation methods from the viewpoint of rate distortion, i.e. error versus the number of parameters used to create the approximant. Another major component in the analysis of numerical approximation is the computational time needed to construct the approximation, and this in turn is intimately connected with the stability of the approximation algorithm. So the stability of numerical approximation using NNs is a large part of the analysis put forward.The survey, for the most part, is concerned with NNs using the popular ReLU activation function. In this case the outputs of the NNs are piecewise linear functions on rather complicated partitions of the domain of f into cells that are convex polytopes. When the architecture of the NN is fixed and the parameters are allowed to vary, the set of output functions of the NN is a parametrized nonlinear manifold. It is shown that this manifold has certain space-filling properties leading to an increased ability to approximate (better rate distortion) but at the expense of numerical stability. The space filling creates the challenge to the numerical method of finding best or good parameter choices when trying to approximate.

Publisher

Cambridge University Press (CUP)

Subject

General Mathematics,Numerical Analysis

Reference95 articles.

1. Cohen, A. , DeVore, R. , Petrova, G. and Wojtaszczyk, P. (2020), Optimal stable nonlinear approximation. Available at arXiv:2009.09907.

2. Gühring, I. , Raslan, M. and Kutyniok, G. (2020), Expressivity of deep neural networks. Available at arXiv:2007.04759.

3. Dziugaite, G. K. and Roy, D. M. (2017), Computing nonvacuous generalization bounds for deep (stochastic) neural networks with many more parameters than training data, in Workshop on Principled Approaches to Deep Learning (ICML 2017).

Cited by 80 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Interplay between depth and width for interpolation in neural ODEs;Neural Networks;2024-12

2. Method of fundamental solutions: New approximation results and applications;Journal of Computational and Applied Mathematics;2024-10

3. Error Analysis Based on Inverse Modified Differential Equations for Discovery of Dynamics Using Linear Multistep Methods and Deep Learning;SIAM Journal on Numerical Analysis;2024-09-04

4. Labeled sample compression schemes for complexes of oriented matroids;Journal of Computer and System Sciences;2024-09

5. Towards stable and efficient nitrogen removal in wastewater treatment processes via an adaptive neural network based sliding mode controller;Water Research X;2024-09