1. Gradient-based learning applied to document recognition
2. Bidirectional recurrent neural networks
3. A. Krizhevsky, I. Sutskever, and G. E. Hinton, in Advances in Neural Information Processing Systems (Neural Information Processing Systems Foundation, 2012), p. 1097.
4. K. He, X. Zhang, S. Ren, and J. Sun, in Proceedings of IEEE CVPR (IEEE, 2016), p. 770.
5. A. Veit, M. J. Wilber, and S. Belongie, in Advances in Neural Information Processing Systems (Neural Information Processing Systems Foundation, 2016), p. 550.