1. Breuel, T. M. (2015). On the Convergence of SGD Training of Neural Networks. ArXiv Preprint arXiv:1508.02790.
2. Daniel, C., Taylor, J., & Nowozin, S. (2016). Learning step size controllers for robust neural network training. In Proceedings of the thirtieth AAAI conference on artificial intelligence.
3. Adaptive subgradient methods for online learning and stochastic optimization;Duchi;Journal of Machine Learning Research (JMLR),2011
4. Assessing the importance of features for multi-layer perceptrons;Egmont-Petersen;Neural Networks,1998
5. Evaluating deep learning architectures for speech emotion recognition;Fayek;Neural Networks,2017