1. ALBERT: A Lite BERT for Self-supervised Learning of Language Representations;lan;Proc of ICLR 2020 Conference,0
2. XLNet: Generalized Autoregressive Pretraining for Language Understanding;yang;Neural Information Processing Systems,2019
3. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation;wu;ArXiv Preprint,2016