1. A convergence theory for deep learning via over-parameterization;Allen-Zhu,2019
2. Analytic study of families of spurious minima in two-layer relu neural networks: a tale of symmetry ii;Arjevani;Advances in Neural Information Processing Systems,2021
3. Fine-grained analysis of optimization and generalization for overparameterized two-layer neural networks;Arora,2019
4. An elementary introduction to modern convex geometry;Ball;Flavors of Geometry,1997
5. Two models of double descent for weak features;Belkin;SIAM Journal on Mathematics of Data Science,2020