1. A convergence theory for deep learning via over-parameterization;Allen-Zhu,2019
2. Lagrangian support vector regression via unconstrained convex minimization;Balasundaram;Neural Networks,2013
3. A representer theorem for deep kernel learning;Bohn;Journal of Machine Learning Research,2019
4. The sem algorithm: a probabilistic teacher algorithm derived from the em algorithm for the mixture problem;Celeux;Computational Statistics Quarterly,1985
5. Underdamped langevin mcmc: a non-asymptotic analysis;Cheng,2018