1. Multimodal sentiment analysis with image-text interaction network[J];Zhu T;IEEE Transactions on Multimedia,2022
2. Adapting BERT for Target-Oriented Multimodal Sentiment Classification
3. Kim W, Son B, Kim I. Vilt: Vision-and-language transformer without convolution or region supervision[C]//International Conference on Machine Learning. PMLR, 2021: 5583-5594.
4. Khan Z, Fu Y. Exploiting BERT for multimodal target sentiment classification through input space translation[C]//Proceedings of the 29th ACM International Conference on Multimedia. 2021: 3034-3042.
5. Collaborative fine-grained interaction learning for image–text sentiment analysis