1. Massively Multilingual Neural Machine Translation
2. Unsupervised Cross-lingual Representation Learning at Scale
3. Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proc. of NAACL.
4. Angela Fan Shruti Bhosale Holger Schwenk Zhiyi Ma Ahmed El-Kishky Siddharth Goyal Mandeep Baines Onur Celebi Guillaume Wenzek Vishrav Chaudhary Naman Goyal Tom Birch Vitaliy Liptchinsky Sergey Edunov Michael Auli and Armand Joulin. 2021. Beyond English-Centric Multilingual Machine Translation. J. Mach. Learn. Res. (2021).
5. Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism