1. NetVLAD: CNN Architecture for Weakly Supervised Place Recognition
2. Minghai Chen , Sen Wang , Paul Pu Liang , Tadas Baltruvsaitis, Amir Zadeh, and Louis-Philippe Morency. 2017 . Multimodal sentiment analysis with word-level fusion and reinforcement learning. In ICMI. Minghai Chen, Sen Wang, Paul Pu Liang, Tadas Baltruvsaitis, Amir Zadeh, and Louis-Philippe Morency. 2017. Multimodal sentiment analysis with word-level fusion and reinforcement learning. In ICMI.
3. Huan Deng , Zhenguo Yang , Tianyong Hao , Qing Li , and Wenyin Liu . 2022. Multimodal Affective Computing with Dense Fusion Transformer for Inter- and Intra-modality Interactions . IEEE Transactions on Multimedia , Vol. Early Access (2022). Huan Deng, Zhenguo Yang, Tianyong Hao, Qing Li, and Wenyin Liu. 2022. Multimodal Affective Computing with Dense Fusion Transformer for Inter- and Intra-modality Interactions. IEEE Transactions on Multimedia, Vol. Early Access (2022).
4. James J. Deng and Clement H. C . Leung . 2021 . Towards Learning a Joint Representation from Transformer in Multimodal Emotion Recognition. In BI. James J. Deng and Clement H. C. Leung. 2021. Towards Learning a Joint Representation from Transformer in Multimodal Emotion Recognition. In BI.
5. A Review and Meta-Analysis of Multimodal Affect Detection Systems