1. Deep learning;LeCun;Nature,2015
2. Attention is all you need;Vaswani;Adv. Neural Inf. Process. Syst.,2017
3. Transformer models for text-based emotion detection: a review of BERT-based approaches;Acheampong;Artif. Intell. Rev.,2021
4. Efficient transformers: A survey;Tay;ACM Comput. Surv.,2020
5. Long-short transformer: Efficient transformers for language and vision;Zhu;Adv. Neural Inf. Process. Syst.,2021