1. Adiwardana, D., & Luong, M.-T. (2020). Towards a human-like opendomain chatbot. arXiv preprint arXiv:2001.09977.
2. Do deep nets really need to be deep?;J.Ba;Advances in Neural Information Processing Systems,2014
3. Beltagy, Iz., & Matthew, E. (2020). Longformer: The long-document transformer. arXiv preprint arXiv:2004.05150.
4. Learning long-term dependencies with gradient descent is difficult
5. BrandR.GholamiS.HorowitzD.ZhouL.BhabeshS. (2022). Text classification for online conversations with machine learning on aws. AWS Machine Learning Blog.