Image-embodied Knowledge Representation Learning-Reference-Cited by-同舟云学术

Image-embodied Knowledge Representation Learning

Published:2017-08 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Xie Ruobing¹,Liu Zhiyuan²³,Luan Huanbo⁴,Sun Maosong⁵⁶

Affiliation:

1. Department of Computer Science and Technology, State Key Lab on Intelligent Technology and Systems, National Lab for Information Science and Technology, Tsinghua University, Beijing, China

2. Department of Computer Science and Technology, Tsinghua University, Beijing, China

3. Jiangsu Collaborative Innovation Center for Language Ability, Jiangsu Normal University, Xuzhou, China

4. Department of Computer Science and Technology, State Key Lab on Intelligent Technology and Systems, National Lab for Information Science and Technology, Tsinghua University, China

5. State Key Laboratory of Intelligent Technology and Systems, Tsinghua National Laboratory for Information Science and Technology, Department of Computer Science and Technology, Tsinghua University, Beijing, China

6. Jiangsu Collaborative Innovation Center for Language Ability, Jiangsu Normal University, Xuzhou 221009 China

Abstract

Entity images could provide significant visual information for knowledge representation learning. Most conventional methods learn knowledge representations merely from structured triples, ignoring rich visual information extracted from entity images. In this paper, we propose a novel Image-embodied Knowledge Representation Learning model (IKRL), where knowledge representations are learned with both triple facts and images. More specifically, we first construct representations for all images of an entity with a neural image encoder. These image representations are then integrated into an aggregated image-based representation via an attention-based method. We evaluate our IKRL models on knowledge graph completion and triple classification. Experimental results demonstrate that our models outperform all baselines on both tasks, which indicates the significance of visual information for knowledge representations and the capability of our models in learning knowledge representations with images.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 91 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A survey: knowledge graph entity alignment research based on graph embedding;Artificial Intelligence Review;2024-08-03

2. MM-Transformer: A Transformer-Based Knowledge Graph Link Prediction Model That Fuses Multimodal Features;Symmetry;2024-07-29

3. Multi-hop neighbor fusion enhanced hierarchical transformer for multi-modal knowledge graph completion;World Wide Web;2024-07-19

4. Contrast then Memorize: Semantic Neighbor Retrieval-Enhanced Inductive Multimodal Knowledge Graph Completion;Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval;2024-07-10

5. NativE: Multi-modal Knowledge Graph Completion in the Wild;Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval;2024-07-10