Building Robust Multimodal Sentiment Recognition via a Simple yet Effective Multimodal Transformer-Reference-Cited by-同舟云学术

Building Robust Multimodal Sentiment Recognition via a Simple yet Effective Multimodal Transformer

Published:2023-10-26 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the 31st ACM International Conference on Multimedia
language:
Short-container-title:

Author:

Zong Daoming¹^ORCID,Ding Chaoyue¹^ORCID,Li Baoxiang¹^ORCID,Zhou Dinghao¹^ORCID,Li Jiakui¹^ORCID,Zheng Ken¹^ORCID,Zhou Qunyan¹^ORCID

Affiliation:

1. SenseTime Group Limited, Beijing, China

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3581783.3612872

Reference35 articles.

1. Alexei Baevski Yuhao Zhou Abdelrahman Mohamed and Michael Auli. 2020. wav2vec 2.0: A framework for self-supervised learning of speech representations. In NeurIPS. 12449--12460. Alexei Baevski Yuhao Zhou Abdelrahman Mohamed and Michael Auli. 2020. wav2vec 2.0: A framework for self-supervised learning of speech representations. In NeurIPS. 12449--12460.

2. Multimodal machine learning: A survey and taxonomy;Tadas Baltruvs;IEEE Transactions on Pattern Analysis and Machine Intelligence,2018

3. Chun-Fu Richard Chen , Quanfu Fan , and Rameswar Panda . 2021 . Crossvit: Cross-attention multi-scale vision transformer for image classification. In ICCV. 357--366. Chun-Fu Richard Chen, Quanfu Fan, and Rameswar Panda. 2021. Crossvit: Cross-attention multi-scale vision transformer for image classification. In ICCV. 357--366.

4. Junyan Cheng Iordanis Fostiropoulos Barry Boehm and Mohammad Soleymani. 2021. Multimodal phased transformer for sentiment analysis. In EMNLP. 2447--2458. Junyan Cheng Iordanis Fostiropoulos Barry Boehm and Mohammad Soleymani. 2021. Multimodal phased transformer for sentiment analysis. In EMNLP. 2447--2458.

5. Lukas Christ Shahin Amiriparian Alice Baird Alexander Kathan Niklas Müller Steffen Klug Chris Gagne Panagiotis Tzirakis Eva-Maria Meßner Andreas König etal 2023. The MuSe 2023 Multimodal Sentiment Analysis Challenge: Mimicked Emotions Cross-Cultural Humour and Personalisation. arXiv preprint arXiv:2305.03369 (2023). Lukas Christ Shahin Amiriparian Alice Baird Alexander Kathan Niklas Müller Steffen Klug Chris Gagne Panagiotis Tzirakis Eva-Maria Meßner Andreas König et al. 2023. The MuSe 2023 Multimodal Sentiment Analysis Challenge: Mimicked Emotions Cross-Cultural Humour and Personalisation. arXiv preprint arXiv:2305.03369 (2023).

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Review of Key Technologies for Emotion Analysis Using Multimodal Information;Cognitive Computation;2024-06-01

2. Improving Multi-Modal Emotion Recognition Using Entropy-Based Fusion and Pruning-Based Network Architecture Optimization;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14