1. Neural Networks for Pattern Recognition;Bishop,1995
2. Neural Networks: Methodology and Applications;Dreyfus,2005
3. N.S. Keskar, D. Mudigere, J. Nocedal, M. Smelyanskiy, P.T.P. Tang, On large-batch training for deep learning: Generalization gap and sharp minima, in: International Conference on Learning Representations, 2017, pp. 1–16.
4. Open problem: The landscape of the loss surfaces of multilayer networks;Choromanska,2015
5. Visualising basins of attraction for the cross-entropy and the squared error neural network loss functions;Bosman;Neurocomputing,2020