AA-RGTCN: reciprocal global temporal convolution network with adaptive alignment for video-based person re-identification-Reference-Cited by-同舟云学术

AA-RGTCN: reciprocal global temporal convolution network with adaptive alignment for video-based person re-identification

Published:2024-03-25 Issue: Volume:18 Page:
ISSN:1662-453X
Container-title:Frontiers in Neuroscience
language:
Short-container-title:Front. Neurosci.

Author:

Zhang Yanjun,Lin Yanru,Yang Xu

Abstract

Person re-identification(Re-ID) aims to retrieve pedestrians under different cameras. Compared with image-based Re-ID, video-based Re-ID extracts features from video sequences that contain both spatial features and temporal features. Existing methods usually focus on the most attractive image parts, and this will lead to redundant spatial description and insufficient temporal description. Other methods that take temporal clues into consideration usually ignore misalignment between frames and only focus on a fixed length of one given sequence. In this study, we proposed a Reciprocal Global Temporal Convolution Network with Adaptive Alignment(AA-RGTCN). The structure could address the drawback of misalignment between frames and model discriminative temporal representation. Specifically, the Adaptive Alignment block is designed to shift each frame adaptively to its best position for temporal modeling. Then, we proposed the Reciprocal Global Temporal Convolution Network to model robust temporal features across different time intervals along both normal and inverted time order. The experimental results show that our AA-RGTCN can achieve 85.9% mAP and 91.0% Rank-1 on MARS, 90.6% Rank-1 on iLIDS-VID, and 96.6% Rank-1 on PRID-2011, indicating we could gain better performance than other state-of-the-art approaches.

Publisher

Frontiers Media SA

Reference31 articles.

1. “Spatio-temporal representation factorization for video-based person re-identification,”;Aich,2021

2. An empirical evaluation of generic convolutional and recurrent networks for sequence modeling;Bai;arXiv [Preprint].,2018

3. “Not 3d re-id: Simple single stream 2d convolution for robust video re-identification,”;Breckon,2021

4. Person re-identification based on partition adaptive network structure and channel partition weight adaptive;Chen;IEEE Access,2021

5. Appearance-preserving 3d convolution for video-based person re-identification;Chen;Comp. Vision C ECCV,2020

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. In-Depth Analysis of GAF-Net: Comparative Fusion Approaches in Video-Based Person Re-Identification;Algorithms;2024-08-11