1. Efficient Esti-mation of Word Representations in Vector Space;mikolov;1st International Conference on Learning Representations ICLR 2013,2013
2. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation
3. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding;devlin;Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics Human Language Technologies NAACL-HLT 2019,0
4. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation;wu;CoRR,2016
5. Temporal pattern attention for multivariate time series forecasting