1. Hierarchical Face Recognition Based on SVDD and SVM
2. Control batch size and learning rate to generalize well: Theoretical and empirical evidence;he;International Conference on Neural Information Processing Systems,2019
3. How to escape saddle points efficiently;jin;International Conference on Machine Learning,2017
4. A diffusion theory for deep learning dynamics: Stochastic gradient descent exponentially favors flat minima;xie;International Conference on Learning Representations,2020