MDSEA: Knowledge Graph Entity Alignment Based on Multimodal Data Supervision-Reference-Cited by-同舟云学术

MDSEA: Knowledge Graph Entity Alignment Based on Multimodal Data Supervision

Published:2024-04-25 Issue:9 Volume:14 Page:3648
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Fang Jianyong¹²^ORCID,Yan Xuefeng¹

Affiliation:

1. College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 210000, China

2. Jiangsu Automation Research Institute, Lianyungang 222000, China

Abstract

With the development of social media, the internet, and sensing technologies, multimodal data are becoming increasingly common. Integrating these data into knowledge graphs can help models to better understand and utilize these rich sources of information. The basic idea of the existing methods for entity alignment in knowledge graphs is to extract different data features, such as structure, text, attributes, images, etc., and then fuse these different modal features. The entity similarity in different knowledge graphs is calculated based on the fused features. However, the structures, attribute information, image information, text descriptions, etc., of different knowledge graphs often have significant differences. Directly integrating different modal information can easily introduce noise, thus affecting the effectiveness of the entity alignment. To address the above issues, this paper proposes a knowledge graph entity alignment method based on multimodal data supervision. First, Transformer is used to obtain encoded representations of knowledge graph entities. Then, a multimodal supervised method is used for learning the entity representations in the knowledge graph so that the vector representations of the entities contain rich multimodal semantic information, thereby enhancing the generalization ability of the learned entity representations. Finally, the information from different modalities is mapped to a shared low-dimensional subspace, making similar entities closer in the subspace, thus optimizing the entity alignment effect. The experiments on the DBP15K dataset compared with methods such as MTransE, JAPE, EVA, DNCN, etc., all achieve optimal results.

Funder

Joint Fund of National Natural Science Foundation of China and Civil Aviation Administration of China

Publisher

MDPI AG

Link

https://www.mdpi.com/2076-3417/14/9/3648/pdf

Reference37 articles.

1. Graph matching for knowledge graph alignment using edge-coloring propagation;Zhang;Pattern Recogn.,2023

2. Cross-modal knowledge reasoning for knowledge-based visual question answering;Yu;Pattern Recogn.,2020

3. Xie, C., Zhang, L., and Zhong, Z. (2023). Entity alignment method based on joint learning of entity and attribute representations. Appl. Sci., 13.

4. Wang, H., Liu, Q., Huang, R., and Zhang, J. (2023). Multi-modal entity alignment method based on feature enhancement. Appl. Sci., 13.

5. Atom correlation based graph propagation for scene graph generation;Lin;Pattern Recogn.,2022