Visual Relationship Detection With Deep Structural Ranking-Reference-Cited by-同舟云学术

Visual Relationship Detection With Deep Structural Ranking

Published:2018-04-27 Issue:1 Volume:32 Page:
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Liang Kongming,Guo Yuhong,Chang Hong,Chen Xilin

Abstract

Visual relationship detection aims to describe the interactions between pairs of objects. Different from individual object learning tasks, the number of possible relationships are much larger, which makes it hard to explore only based on the visual appearance of objects. In addition, due to the limited human effort, the annotations for visual relationships are usually incomplete which increases the difficulty of model training and evaluation. In this paper, we propose a novel framework, called Deep Structural Ranking, for visual relationship detection. To complement the representation ability of visual appearance, we integrate multiple cues for predicting the relationships contained in an input image. Moreover, we design a new ranking objective function by enforcing the annotated relationships to have higher relevance scores. Unlike previous works, our proposed method can both facilitate the co-occurrence of relationships and mitigate the incompleteness problem. Experimental results show that our proposed method outperforms the state-of-the-art on the two widely used datasets. We also demonstrate its superiority in detecting zero-shot relationships.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 28 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Online video visual relation detection with hierarchical multi-modal fusion;Multimedia Tools and Applications;2024-01-18

2. Knowledge Enhanced Zero-Shot Visual Relationship Detection;Lecture Notes in Computer Science;2024

3. Focus the Overlapping Problem on Few-Shot Object Detection via Multiple Predictions;Pattern Recognition and Computer Vision;2023-12-24

4. Plugging Stylized Controls in Open-Stylized Image Captioning;Pattern Recognition and Computer Vision;2023-12-24

5. Prioritized Planning for Target-Oriented Manipulation via Hierarchical Stacking Relationship Prediction;2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS);2023-10-01