1. Carlos Busso , Murtaza Bulut , Chi-Chun Lee , Abe Kazemzadeh , Emily Mower , Samuel Kim , Jeannette N. Chang , Sungbok Lee , and Shrikanth S. Narayanan . 2008. IEMOCAP: interactive emotional dyadic motion capture database. Language resources and evaluation 42, 4 ( 2008 ), 335–359. https://doi.org/10.1007/s10579-008-9076-6 10.1007/s10579-008-9076-6 Carlos Busso, Murtaza Bulut, Chi-Chun Lee, Abe Kazemzadeh, Emily Mower, Samuel Kim, Jeannette N. Chang, Sungbok Lee, and Shrikanth S. Narayanan. 2008. IEMOCAP: interactive emotional dyadic motion capture database. Language resources and evaluation 42, 4 (2008), 335–359. https://doi.org/10.1007/s10579-008-9076-6
2. Hierarchical Network Based on the Fusion of Static and Dynamic Features for Speech Emotion Recognition
3. Sequential Graph Convolutional Network for Active Learning
4. Investigating Transformer Encoders and Fusion Strategies for Speech Emotion Recognition in Emergency Call Center Conversations.
5. Shubham Dokania and Vasudev Singh . 2019. Graph Representation learning for Audio & Music genre Classification. CoRR abs/1910.11117 ( 2019 ). arXiv:1910.11117 Shubham Dokania and Vasudev Singh. 2019. Graph Representation learning for Audio & Music genre Classification. CoRR abs/1910.11117 (2019). arXiv:1910.11117