How Tight Are the Vapnik-Chervonenkis Bounds?-Reference-Cited by-同舟云学术

How Tight Are the Vapnik-Chervonenkis Bounds?

Published:1992-03 Issue:2 Volume:4 Page:249-269
ISSN:0899-7667
Container-title:Neural Computation
language:en
Short-container-title:Neural Computation

Author:

Cohn David¹,Tesauro Gerald²

Affiliation:

1. Department of Computer Science and Engineering, University of Washington, Seattle, WA 98195 USA

2. IBM Watson Research Center, P.O. Box 704, Yorktown Heights, NY 10598 USA

Abstract

We describe a series of numerical experiments that measure the average generalization capability of neural networks trained on a variety of simple functions. These experiments are designed to test the relationship between average generalization performance and the worst-case bounds obtained from formal learning theory using the Vapnik-Chervonenkis (VC) dimension (Blumer et al. 1989; Haussler et al. 1990). Recent statistical learning theories (Tishby et al. 1989; Schwartz et al. 1990) suggest that surpassing these bounds might be possible if the spectrum of possible generalizations has a “gap” near perfect performance. We indeed find that, in some cases, the average generalization is significantly better than the VC bound: the approach to perfect performance is exponential in the number of examples m, rather than the 1/m result of the bound. However, in these cases, we have not found evidence of the gap predicted by the above statistical theories. In other cases, we do find the 1/m behavior of the VC bound, and in these cases, the numerical prefactor is closely related to the prefactor contained in the bound.

Publisher

MIT Press - Journals

Subject

Cognitive Neuroscience,Arts and Humanities (miscellaneous)

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/neco.1992.4.2.249

Reference8 articles.

1. What Size Net Gives Valid Generalization?

2. Learnability and the Vapnik-Chervonenkis dimension

3. The multilayer perceptron as an approximation to a Bayes optimal discriminant function

4. Exhaustive Learning

Cited by 44 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Resolution of similar patterns in a solvable model of unsupervised deep learning with structured data;Chaos, Solitons & Fractals;2024-05

2. Critical properties of the SAT/UNSAT transitions in the classification problem of structured data;Journal of Statistical Mechanics: Theory and Experiment;2021-11-01

3. A theory of universal learning;Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing;2021-06-15

4. Statistical learning theory of structured data;Physical Review E;2020-09-14

5. Beyond the Storage Capacity: Data-Driven Satisfiability Transition;Physical Review Letters;2020-09-14