1. Allen-Zhu Z, Li Y, Liang Y. Learning and generalization in overparameterized neural networks, going beyond two layers. ArXiv:1811.04918, 2018
2. Allen-Zhu Z, Li Y, Song Z. A convergence theory for deep learning via over-parameterization. ArXiv:1811.03962, 2018
3. Aronszajn N. Theory of reproducing kernels. Trans Amer Math Soc, 1950, 68: 337–404
4. Arora S, Du S S, Hu W, et al. Fine-grained analysis of optimization and generalization for overparameterized two-layer neural networks. ArXiv:1901.08584, 2019
5. Barron A R. Universal approximation bounds for superpositions of a sigmoidal function. IEEE Trans Inform Theory, 1993, 39: 930–945