1. Optimization methods for large-scale machine learning;SIAM Rev.,2018
2. Bottou, L. (2010, January 22–27). Large-scale machine learning with stochastic gradient descent. Proceedings of the International Conference on Computational Statistics, Paris, France.
3. Hardt, M., Recht, B., and Singer, Y. (2016, January 19–24). Train faster, generalize better: Stability of stochastic gradient descent. Proceedings of the International Conference on Machine Learning, New York, NY, USA.
4. Acceleration of stochastic approximation by averaging;SIAM J. Control. Optim.,1992
5. Zinkevich, M. (2003, January 21–24). Online convex programming and generalized infinitesimal gradient ascent. Proceedings of the 20th International Conference on Machine Learning (ICML-03), Washington, DC, USA.