1. Arthur, P., Neubug, G., & Nakamura, S. (2016). Incorporating discrete translation lexicons into neural machine translation. In Proceedings of EMNLP. arXiv:1606.02006v2 .
2. Auli, M., & Gao, J. (2014). Decoder integration and expected bleu training for recurrent neural network language models. In Proceedings of ACL (pp. 136–142).
3. Bahdanau, D., Cho, K., & Bengio, Y. (2015). Neural machine translation by jointly learning to align and translate. In Proceedings of ICLR.
4. Bengio, Y., Ducharme, R., Vincent, P., & Jauvin, C. (2003). A neural probabilistic language model. Journal of Machine Learning Research, 3, 1137–1155.
5. Brown, P. F., Della Pietra, S. A., Della Pietra, V. J., & Mercer, R. L. (1993). The mathematics of statistical machine translation: Parameter estimation. Computational Linguistics.