Universal Function Approximation by Deep Neural Nets with Bounded Width and ReLU Activations-Reference-Cited by-同舟云学术

Universal Function Approximation by Deep Neural Nets with Bounded Width and ReLU Activations

Published:2019-10-18 Issue:10 Volume:7 Page:992
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Hanin Boris

Abstract

This article concerns the expressive power of depth in neural nets with ReLU activations and a bounded width. We are particularly interested in the following questions: What is the minimal width w min ( d ) so that ReLU nets of width w min ( d ) (and arbitrary depth) can approximate any continuous function on the unit cube [ 0 , 1 ] d arbitrarily well? For ReLU nets near this minimal width, what can one say about the depth necessary to approximate a given function? We obtain an essentially complete answer to these questions for convex functions. Our approach is based on the observation that, due to the convexity of the ReLU activation, ReLU nets are particularly well suited to represent convex functions. In particular, we prove that ReLU nets with width d + 1 can approximate any continuous convex function of d variables arbitrarily well. These results then give quantitative depth estimates for the rate of approximation of any continuous scalar function on the d-dimensional cube [ 0 , 1 ] d by ReLU nets with width d + 3 .

Funder

National Science Foundation

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/7/10/992/pdf

Reference18 articles.

1. Deep learning;Bengio;Nature,2015

2. Learning functions: When is deep better than shallow;Liao;arXiv,2016

3. Why Does Deep and Cheap Learning Work So Well?

4. Deep vs. shallow networks: An approximation theory perspective

Cited by 152 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Prediction of discretization of online GMsFEM using deep learning for Richards equation;Journal of Computational and Applied Mathematics;2025-01

2. A multiparametric approach to accelerating ReLU neural network based model predictive control;Control Engineering Practice;2024-10

3. Designing Universally-Approximating Deep Neural Networks: A First-Order Optimization Approach;IEEE Transactions on Pattern Analysis and Machine Intelligence;2024-09

4. Probabilistic Contraction Analysis of Iterated Random Operators;IEEE Transactions on Automatic Control;2024-09

5. Predicting and explaining nonlinear material response using deep physically guided neural networks with internal variables;Mathematics and Mechanics of Solids;2024-07-26