Proof of the Theory-to-Practice Gap in Deep Learning via Sampling Complexity bounds for Neural Network Approximation Spaces-Reference-Cited by-同舟云学术

Proof of the Theory-to-Practice Gap in Deep Learning via Sampling Complexity bounds for Neural Network Approximation Spaces

Published:2023-07-12 Issue: Volume: Page:
ISSN:1615-3375
Container-title:Foundations of Computational Mathematics
language:en
Short-container-title:Found Comput Math

Author:

Grohs Philipp,Voigtlaender Felix

Abstract

AbstractWe study the computational complexity of (deterministic or randomized) algorithms based on point samples for approximating or integrating functions that can be well approximated by neural networks. Such algorithms (most prominently stochastic gradient descent and its variants) are used extensively in the field of deep learning. One of the most important problems in this field concerns the question of whether it is possible to realize theoretically provable neural network approximation rates by such algorithms. We answer this question in the negative by proving hardness results for the problems of approximation and integration on a novel class of neural network approximation spaces. In particular, our results confirm a conjectured and empirically observed theory-to-practice gap in deep learning. We complement our hardness results by showing that error bounds of a comparable order of convergence are (at least theoretically) achievable.

Funder

University of Vienna

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Computational Theory and Mathematics,Computational Mathematics,Analysis

Link

https://link.springer.com/content/pdf/10.1007/s10208-023-09607-w.pdf

Reference53 articles.

1. B. Adcock and N. Dexter. The gap between theory and practice in function approximation with deep neural networks. SIAM Journal on Mathematics of Data Science, 3(2):624–655, 2021.

2. C. D. Aliprantis and K. C. Border. Infinite dimensional analysis. Springer, Berlin, third edition, 2006.

3. V. Antun, M. J. Colbrook, and A. C. Hansen. The difficulty of computing stable and accurate neural networks: On the barriers of deep learning and Smale’s 18th problem. Applied Mathematics, 119(12):e21071511, 2022.

4. S. Arridge, P. Maass, O. Öktem, and C.-B. Schönlieb. Solving inverse problems using data-driven models. Acta Numerica, 28:1–174, 2019.

5. P. Baldi, P. Sadowski, and D. Whiteson. Searching for exotic particles in high-energy physics with deep learning. Nature communications, 5(1):1–9, 2014.

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A model is worth tens of thousands of examples for estimation and thousands for classification;Pattern Recognition;2025-01

2. Best Approximation and Inverse Results for Neural Network Operators;Results in Mathematics;2024-06-22

3. Sampling complexity of deep approximation spaces;Analysis and Applications;2024-05-31

4. Sample complexity bounds for the local convergence of least squares approximation;Analysis and Applications;2024-04-05

5. Learning smooth functions in high dimensions;Handbook of Numerical Analysis;2024