1. Bahdanau, D., Cho, K., & Bengio, Y. (2014). Neural machine translation by jointly learning to align and translate. Technical report, arXiv preprint arXiv:1409.0473.
2. Learning deep architectures for AI;Bengio,2009
3. Bengio, Y., Lamblin, P., Popovici, D., & Larochelle, H. (2007). Greedy layer-wise training of deep networks. In NIPS.
4. Multi-column deep neural network for traffic sign classification;Ciresan;Neural Networks,2012
5. Dahl, G.E., Ranzato, M., Mohamed, A., & Hinton, G.E. (2010). Phone recognition with the mean-covariance restricted Boltzmann machine. In NIPS.