1. Attention is all you need;Vaswani;Adv. Neural Inf. Process. Syst.,2017
2. Devlin, J., Chang, M.W., Lee, K., and Toutanova, K. (2018). Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv.
3. Radford, A., Narasimhan, K., Salimans, T., and Sutskever, I. (2018). Improving language understanding by generative pre-training. OpenAI.
4. Natural language processing;Chowdhary;Fundam. Artif. Intell.,2020
5. A survey of the usages of deep learning for natural language processing;Otter;IEEE Trans. Neural Netw. Learn. Syst.,2020