RpBERT: A Text-image Relation Propagation-based BERT Model for Multimodal NER-Reference-Cited by-同舟云学术

RpBERT: A Text-image Relation Propagation-based BERT Model for Multimodal NER

Published:2021-05-18 Issue:15 Volume:35 Page:13860-13868
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Sun Lin,Wang Jiquan,Zhang Kai,Su Yindu,Weng Fangsheng

Abstract

Recently multimodal named entity recognition (MNER) has utilized images to improve the accuracy of NER in tweets. However, most of the multimodal methods use attention mechanisms to extract visual clues regardless of whether the text and image are relevant. Practically, the irrelevant text-image pairs account for a large proportion in tweets. The visual clues that are unrelated to the texts will exert uncertain or even negative effects on multimodal model learning. In this paper, we introduce a method of text-image relation propagation into the multimodal BERT model. We integrate soft or hard gates to select visual clues and propose a multitask algorithm to train and validate the effects of relation propagation on the MNER datasets. In the experiments, we deeply analyze the changes in visual attention before and after the use of relation propagation. Our model achieves state-of-the-art performance on the MNER datasets.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 53 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The more quality information the better: Hierarchical generation of multi-evidence alignment and fusion model for multimodal entity and relation extraction;Information Processing & Management;2025-01

2. ICKA: An instruction construction and Knowledge Alignment framework for Multimodal Named Entity Recognition;Expert Systems with Applications;2024-12

3. Joint multimodal entity-relation extraction based on temporal enhancement and similarity-gated attention;Knowledge-Based Systems;2024-11

4. Joint Modal Circular Complementary Attention for Multimodal Aspect-Based Sentiment Analysis;2024 IEEE International Conference on Multimedia and Expo Workshops (ICMEW);2024-07-15

5. SAMNER: Image Screening and Cross-Modal Alignment Networks for Multimodal Named Entity Recognition;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30