Improving Scene Graph Classification by Exploiting Knowledge from Texts-Reference-Cited by-同舟云学术

Improving Scene Graph Classification by Exploiting Knowledge from Texts

Published:2022-06-28 Issue:2 Volume:36 Page:2189-2197
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Sharifzadeh Sahand,Baharlou Sina Moayed,Schmitt Martin,Schütze Hinrich,Tresp Volker

Abstract

Training scene graph classification models requires a large amount of annotated image data. Meanwhile, scene graphs represent relational knowledge that can be modeled with symbolic data from texts or knowledge graphs. While image annotation demands extensive labor, collecting textual descriptions of natural scenes requires less effort. In this work, we investigate whether textual scene descriptions can substitute for annotated image data. To this end, we employ a scene graph classification framework that is trained not only from annotated images but also from symbolic data. In our architecture, the symbolic entities are first mapped to their correspondent image-grounded representations and then fed into the relational reasoning pipeline. Even though a structured form of knowledge, such as the form in knowledge graphs, is not always available, we can generate it from unstructured texts using a transformer-based language model. We show that by fine-tuning the classification pipeline with the extracted knowledge from texts, we can achieve ~8x more accurate results in scene graph classification, ~3x in object classification, and ~1.5x in predicate classification, compared to the supervised baselines with only 1% of the annotated images.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Review on scene graph generation methods;Multiagent and Grid Systems;2024-08-12

2. Improving rare relation inferring for scene graph generation using bipartite graph network;Computer Vision and Image Understanding;2024-02

3. Prior Knowledge-driven Dynamic Scene Graph Generation with Causal Inference;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26

4. Iterative Learning with Extra and Inner Knowledge for Long-tail Dynamic Scene Graph Generation;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26

5. Scene Graph Generation using Depth-based Multimodal Network;2023 IEEE International Conference on Multimedia and Expo (ICME);2023-07