1. Bahdanau, D., Cho, K., Bengio, Y.: Neural Machine Translation by Jointly Learning to Align and Translate. arXiv preprint arXiv:14090473 (2014)
2. Luong, M.T., Pham, H., Manning, C.D.: Effective Approaches to Attention-based Neural Machine Translation. arXiv:150804025v5 (2015)
3. Vaswani, A., et al.: Attention is all you need. In: 31st Conference on Neural Information Processing Systems, Long Beach, CA, USA (2017)
4. Edunov, S., Ott, M., Auli, M., Grangier, D.: Understanding Back-Translation at Scale. arXiv:180809381v2 (2018)
5. Ott, M., Edunov, S., Grangier, D., Auli, M.: Scaling Neural Machine Translation. arXiv:180600187v3 (2018)