1. Hinton GE, Osindero S, Teh Y-W (2006) A fast learning algorithm for deep belief nets. Neural Comput 18(7):1527–1554
2. Nair V, Hinton GE (2010) Rectified linear units improve restricted boltzmann machines. In: Icml
3. Hinton GE, Srivastava N, Krizhevsky A, Sutskever I, Salakhutdinov RR, Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580
4. Goodfellow I, Warde-Farley D, Mirza M, Courville A, Bengio Y (2013) Maxout networks. In: International conference on machine learning, PMLR, pp 1319–1327
5. Agostinelli F, Hoffman M, Sadowski P, Baldi P, Learning activation functions to improve deep neural networks. arXiv preprint arXiv:1412.6830