1. Ellipsoidal trust region methods and the marginal value of Hessian information for neural network training;Adolphs,2019
2. Finding local minima for nonconvex optimization in linear time;Agarwal,2016
3. Natasha 2: Faster non-convex optimization than SGD;Allen-Zhu;Advances in Neural Information Processing Systems,2018
4. Efficient approaches for escaping higher order saddle points in non-convex optimization;Anandkumar,2016
5. Second-order information in non-convex stochastic optimization: power and limitations;Arjevani,2020