1. Multimodal sentiment analysis with word-level fusion and reinforcement learning;Chen,2017
2. Fu, Z., Fu, Z., Liu, Q., Cai, W., Wang, Y., 2022. Sparsett: Visual tracking with sparse transformers. arXiv preprint arXiv:2205.03776.
3. A novel feature fusion network for multimodal emotion recognition from EEG and eye movement signals;Fu;Front. Neurosci.,2023
4. Ghosh, S., Tyagi, U., Ramaneswaran, S., Srivastava, H., Manocha, D., 2022. Mmer: Multimodal multi-task learning for speech emotion recognition. arXiv preprint arXiv:2203.16794.
5. Multi-modal emotion recognition with self-guided modality calibration;Hou,2022