1. Dzmitry
Bahdanau,
Philemon
Brakel,
Kelvin
Xu
, AnirudhGoyal, RyanLowe, JoellePineau, AaronCourville, and YoshuaBengio.
2017. An actor-critic algorithm for sequence prediction. In Proceedings of ICLR 2017.
2. Dzmitry
Bahdanau,
Kyunghyun
Cho
, and YoshuaBengio.
2015. Neural machine translation by jointly learning to align and translate. In Proceedings of ICLR 2015.
3. Mia Xu
Chen,
Orhan
Firat,
Ankur
Bapna,
Melvin
Johnson,
Wolfgang
Macherey,
George
Foster,
Llion
Jones,
Mike
Schuster,
Noam
Shazeer,
Niki
Parmar,
Ashish
Vaswani,
Jakob
Uszkoreit,
Lukasz
Kaiser,
Zhifeng
Chen,
Yonghui
Wu
, and MacduffHughes.
2018. The best of both worlds: Combining recent advances in neural machine translation. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 76–86. Association for Computational Linguistics.
4. Yongchao
Deng,
Shanbo
Cheng,
Jun
Lu
, KaiSong, JingangWang, ShenglanWu, LiangYao, GuchunZhang, HaiboZhang, PeiZhang, ChangfengZhu, and BoxingChen.
2018. Alibaba’s neural machine translation systems for wmt18. In Proceedings of the Third Conference on Machine Translation: Shared Task Papers, pages 368–376. Association for Computational Linguistics.
5. Jacob
Devlin,
Ming-Wei
Chang,
Kenton
Lee
, and KristinaToutanova.
2018. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.