1. Zhang, H. , Cisse, M. , Dauphin, Y. N. and Lopez-Paz, D. (2018). mixup: Beyond empirical risk minimization. In International Conference on Learning Representations.
2. Vaswani, A. , Shazeer, N. M. , Parmar, N. , Uszkoreit, J. , Jones, L. , Gomez, A. N. , Kaiser, L. and Polosukhin, I. (2017). Attention is all you need. arXiv:1706.03762.
3. OpenAI (2022). Embeddings. OpenAI Documentation. Available at https://platform.openai.com/docs/guides/embeddings/what-are-embeddings
4. Deep Learning--based Text Classification
5. A Survey on Transfer Learning