1. SGD‐QN: careful Quasi‐Newton stochastic gradient descent;Bordes A;J Mach Learn Res,2009
2. Trust region Newton method for large‐scale logistic regression;Lin C‐J;J Mach Learn Res,2008
3. Advances in optimizing recurrent networks
4. Clarkson Kenneth Lee. Algorithms for Closest‐Point Problems (Computational Geometry). Diss. Stanford University 1985.
5. Abadi M Agarwal A Barham P et al. Tensorflow: large‐scale machine learning on heterogeneous distributed systems; 2016. arXiv preprint arXiv:1603.04467.