A dual‐modal graph attention interaction network for person Re‐identification-Reference-Cited by-同舟云学术

A dual‐modal graph attention interaction network for person Re‐identification

Published:2023-03-28 Issue:6 Volume:17 Page:687-699
ISSN:1751-9632
Container-title:IET Computer Vision
language:en
Short-container-title:IET Computer Vision

Author:

Wang Wen¹²^ORCID,An Gaoyun¹²,Ruan Qiuqi¹²

Affiliation:

1. Institute of Information Science Beijing Jiaotong University Beijing China

2. Beijing Key Laboratory of Advanced Information Science and Network Technology Beijing China

Abstract

AbstractPerson Re‐identification (Re‐ID) is a task of matching target pedestrians under cross‐camera surveillance. Learning discriminative feature representations is the main issue for person Re‐ID. A few recent methods introduce text descriptions as auxiliary information to enhance feature representations, as it offers richer semantic information and perspective consistency. However, these works usually process text and images separately, which leads to the absence of cross‐modal interactions. In this article, a Dual‐modal Graph Attention Interaction Network (Dual‐GAIN) is proposed to integrate visual features and textual features into a heterogeneous graph to model the relationship between them, simultaneously. The proposed Dual‐GAIN mainly consists of two components: a dual‐stream feature extractor and a Graph Attention Interaction Network (GAIN). Specifically, the two‐stream feature extractor is utilised to extract visual features and textual features respectively. Then, visual local features and textual features are treated as nodes to construct a multi‐modal graph. Cosine similarity constrained attention weights are introduced in GAIN, which is designed for cross‐modal interaction and feature fusion on this heterogeneous multi‐modal graph. Experiments on public large‐scale datasets, that is, Market‐1501, CUHK03 labelled, and CUHK03 detected, demonstrate our method achieves the state‐of‐the‐art performance.

Funder

National Natural Science Foundation of China

Publisher

Institution of Engineering and Technology (IET)

Subject

Computer Vision and Pattern Recognition,Software

Reference59 articles.

1. Multi-level Factorisation Net for Person Re-identification

2. High-Order Information Matters: Learning Relation and Topology for Occluded Person Re-Identification

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Staged Feature Mapping Optimization Learning for Visible-Infrared Person Re-identification;2024-06-18