EMO-AVSR: Two-Level Approach for Audio-Visual Emotional Speech Recognition
Author:
Publisher
Springer Nature Switzerland
Link
https://link.springer.com/content/pdf/10.1007/978-3-031-48309-7_2
Reference50 articles.
1. Boháček, M., Hrúz, M.: Sign pose-based transformer for word-level sign language recognition. In: Winter Conference on Applications of Computer Vision (WACV), pp. 182–191 (2022). https://doi.org/10.1109/WACVW54805.2022.00024
2. Cao, H., Cooper, D.G., Keutmann, M.K., Gur, R.C., Nenkova, A., Verma, R.: Crema-D: crowd-sourced emotional multimodal actors dataset. IEEE Trans. Affect. Comput. 5(4), 377–390 (2014). https://doi.org/10.1109/TAFFC.2014.2336244
3. Chen, C., Hu, Y., Zhang, Q., Zou, H., Zhu, B., Chng, E.S.: Leveraging modality-specific representations for audio-visual speech recognition via reinforcement learning. In: AAAI Conference on Artificial Intelligence, vol. 37, pp. 12607–12615 (2023). https://doi.org/10.48550/arXiv.2212.05301
4. Lecture Notes in Computer Science;JS Chung,2017
5. Deng, D., Chen, Z., Zhou, Y., Shi, B.: Mimamo net: integrating micro-and macro-motion for video emotion recognition. In: AAAI Conference on Artificial Intelligence, vol. 34, pp. 2621–2628 (2020). https://doi.org/10.1609/AAAI.V34I03.5646
1.学者识别学者识别
2.学术分析学术分析
3.人才评估人才评估
"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370
www.globalauthorid.com
TOP
Copyright © 2019-2024 北京同舟云网络信息技术有限公司 京公网安备11010802033243号 京ICP备18003416号-3