1. Bad global minima exist and SGD can reach them;Liu,2019
2. Poly-time universality and limitations of deep learning;Abbe,2020
3. Shape matters: understanding the implicit bias of the noise covariance;HaoChen,2020
4. Stochastic gradient and Langevin processes;Cheng,2020
5. Stochastic modified equations and adaptive stochastic gradient algorithms;Li,2017