When and Why Are Deep Networks Better Than Shallow Ones?-Reference-Cited by-同舟云学术

When and Why Are Deep Networks Better Than Shallow Ones?

Published:2017-02-13 Issue:1 Volume:31 Page:
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Mhaskar Hrushikesh,Liao Qianli,Poggio Tomaso

Abstract

While the universal approximation property holds both for hierarchical and shallow networks, deep networks can approximate the class of compositional functions as well as shallow networks but with exponentially lower number of training parameters and sample complexity. Compositional functions are obtained as a hierarchy of local constituent functions, where "local functions'' are functions with low dimensionality. This theorem proves an old conjecture by Bengio on the role of depth in networks, characterizing precisely the conditions under which it holds. It also suggests possible answers to the the puzzle of why high-dimensional deep networks trained on large training sets often do not seem to show overfit.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 28 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Interplay between depth and width for interpolation in neural ODEs;Neural Networks;2024-12

2. How Deep Neural Networks Learn Compositional Data: The Random Hierarchy Model;Physical Review X;2024-07-01

3. Analysis of the neural network application effectiveness in predicting collision avoidance maneuvers for two vessels;Vestnik Gosudarstvennogo universiteta morskogo i rechnogo flota imeni admirala S. O. Makarova;2024-05-22

4. Robust Backdoor Detection for Deep Learning via Topological Evolution Dynamics;2024 IEEE Symposium on Security and Privacy (SP);2024-05-19

5. Comparing the advantages and disadvantages of physics-based and neural network-based modelling for predicting cycling power;Journal of Biomechanics;2024-05