1. Alan Turing, Computing machinery and and intelligence, Mind, 1950.
2. Thomas M Mitchell, Machine Learning, McGraw-Hill, Inc. New York, NY, USA, 1997.
3. D E Rumelhart, G E Hinton and R J Williams, Learning representations by back-propagating errors, Nature, Vol.323, pp.533–536, 1986.
4. Sebastian Ruder, An overview of gradient descent optimization algorithms, CoRR, 1609.04747, 2016.
5. A Krizhevsk, I Sutskever and G Hinton, ImageNet classification with deep convolutional neural networks, Proc. Advances in Neural Information Processing Systems (NIPS), 2012.