EMO-AVSR: Two-Level Approach for Audio-Visual Emotional Speech Recognition-Reference-Cited by-同舟云学术

登录注册会员服务联系我们

EMO-AVSR: Two-Level Approach for Audio-Visual Emotional Speech Recognition

Published:2023 Issue: Volume: Page:18-31
ISSN:0302-9743
Container-title:Speech and Computer
language:
Short-container-title:

Author:

Ivanko Denis^ORCID,Ryumina Elena^ORCID,Ryumin Dmitry^ORCID,Axyonov Alexandr^ORCID,Kashevnik Alexey^ORCID,Karpov Alexey^ORCID

Publisher

Springer Nature Switzerland

Link

https://link.springer.com/content/pdf/10.1007/978-3-031-48309-7_2

Reference50 articles.

1. Boháček, M., Hrúz, M.: Sign pose-based transformer for word-level sign language recognition. In: Winter Conference on Applications of Computer Vision (WACV), pp. 182–191 (2022). https://doi.org/10.1109/WACVW54805.2022.00024

2. Cao, H., Cooper, D.G., Keutmann, M.K., Gur, R.C., Nenkova, A., Verma, R.: Crema-D: crowd-sourced emotional multimodal actors dataset. IEEE Trans. Affect. Comput. 5(4), 377–390 (2014). https://doi.org/10.1109/TAFFC.2014.2336244

3. Chen, C., Hu, Y., Zhang, Q., Zou, H., Zhu, B., Chng, E.S.: Leveraging modality-specific representations for audio-visual speech recognition via reinforcement learning. In: AAAI Conference on Artificial Intelligence, vol. 37, pp. 12607–12615 (2023). https://doi.org/10.48550/arXiv.2212.05301

4. Lecture Notes in Computer Science;JS Chung,2017

5. Deng, D., Chen, Z., Zhou, Y., Shi, B.: Mimamo net: integrating micro-and macro-motion for video emotion recognition. In: AAAI Conference on Artificial Intelligence, vol. 34, pp. 2621–2628 (2020). https://doi.org/10.1609/AAAI.V34I03.5646

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线，采集、加工和组织学术论文而形成的新型学术文献查询和分析系统，可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容，当前同舟云学术共收录了国内外主流学术期刊6万余种，收集的期刊论文及会议论文总量共计约1.5亿篇，并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询！咨询电话：010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号京ICP备18003416号-3