1. Emotion Recognition from Speech Using wav2vec 2.0 Embeddings
2. Temporal Context in Speech Emotion Recognition
3. BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding;devlin,2019
4. Wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations;baevski,2020
5. Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training