1. Multimodal video sentiment analysis using deep learning approaches, a survey;S A Abdu;Information Fusion,2021
2. Bottom-up and top-down attention for image captioning and visual question answering;P Anderson;Proceedings of CVPR,2018
3. Generalisation in named entity recognition: A quantitative analysis;I Augenstein;Computer Speech & Language,2017
4. Can images help recognize entities? a study of the role of images for multimodal ner;S Chen;Proceedings of W-NUT,2021
5. Good visual guidance make a better extractor: Hierarchical visual prefix for multimodal entity and relation extraction;X Chen;Findings of NAACL,2022