Effect of Attention and Self-Supervised Speech Embeddings on Non-Semantic Speech Tasks-Reference-Cited by-同舟云学术

Effect of Attention and Self-Supervised Speech Embeddings on Non-Semantic Speech Tasks

Published:2023-10-26 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 31st ACM International Conference on Multimedia
language:
Short-container-title:

Author:

Mohapatra Payal¹^ORCID,Pandey Akash¹^ORCID,Sui Yueyuan¹^ORCID,Zhu Qi¹^ORCID

Affiliation:

1. Northwestern University, Evanston, IL, USA

Funder

National Science Foundation

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3581783.3612855

Reference22 articles.

1. Speech Emotion Recognition with deep learning

2. Alexei Baevski , Yuhao Zhou , Abdelrahman Mohamed , and Michael Auli . 2020. wav2vec 2.0: A framework for self-supervised learning of speech representations. Advances in neural information processing systems , Vol. 33 ( 2020 ), 12449--12460. Alexei Baevski, Yuhao Zhou, Abdelrahman Mohamed, and Michael Auli. 2020. wav2vec 2.0: A framework for self-supervised learning of speech representations. Advances in neural information processing systems, Vol. 33 (2020), 12449--12460.

3. Carlos Busso , Murtaza Bulut , Chi-Chun Lee , Abe Kazemzadeh , Emily Mower , Samuel Kim , Jeannette N Chang , Sungbok Lee , and Shrikanth S Narayanan . 2008 . IEMOCAP: Interactive emotional dyadic motion capture database. Language resources and evaluation , Vol. 42 (2008), 335--359. Carlos Busso, Murtaza Bulut, Chi-Chun Lee, Abe Kazemzadeh, Emily Mower, Samuel Kim, Jeannette N Chang, Sungbok Lee, and Shrikanth S Narayanan. 2008. IEMOCAP: Interactive emotional dyadic motion capture database. Language resources and evaluation, Vol. 42 (2008), 335--359.

4. Speech emotion recognition with multi-task learning;Cai Xingyu;Interspeech,2021

5. Mapping 24 emotions conveyed by brief human vocalization.