1. J. Bouvrie, Notes on convolutional neural networks, (n.d.).
2. ImageNet classification with deep convolutional neural networks;Krizhevsky;Commun. ACM.,2017
3. S. Ruder, An overview of gradient descent optimization algorithms, (2017). https://doi.org/10.48550/arXiv.1609.04747.
4. Large-scale machine learning with stochastic gradient descent;Bottou,2010
5. Gradient methods for minimizing composite functions;Nesterov;Math. Program.,2013