1. Efficient online and batch learning using forward backward splitting;Duchi;J. Mach. Learn. Res.,2009
2. Deep Learning;Goodfellow,2016
3. On the convergence of stochastic gradient descent with adaptive stepsizes;Li,2019
4. Learning overparameterized neural networks via stochastic gradient descent on structured data;Li,2018
5. Optimal rates for multi-pass stochastic gradient methods;Lin;J. Mach. Learn. Res.,2017