1. Language Modeling with Gated Convolutional Networks;dauphin;ArXiv,2017
2. Performance Analysis of Various Activation Functions in Generalized MLP Architectures of Neural Networks;karlik;International Journal of Artificial Intelligence and Expert Systems,2011
3. Rectified linear units improve restricted boltzmann machines;nair;Haifa,2010
4. Activation functions in deep learning: A comprehensive survey and benchmark
5. Lecture 6.5-rmsprop: “Divide the gradient by a running average of its recent magnitude;tieleman;COURSERA Neural Netw Mach Learn,2012