1. On the difficulty of training recurrent neural networks;Pascanu,2013
2. D. Mishkin, J. Matas, All you need is a good init, arXiv preprint arXiv:1511.06422(2015).
3. Improving the learning speed of 2-layer neural networks by choosing initial values of the adaptive weights;Nguyen,1990
4. Understanding the difficulty of training deep feedforward neural networks;Glorot,2010
5. Delving deep into rectifiers: surpassing human-level performance on imagenet classification;He,2015