Spatial and Temporal Mutual Promotion for Video-Based Person Re-Identification-Reference-Cited by-同舟云学术

Spatial and Temporal Mutual Promotion for Video-Based Person Re-Identification

Published:2019-07-17 Issue: Volume:33 Page:8786-8793
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Liu Yiheng,Yuan Zhenxun,Zhou Wengang,Li Houqiang

Abstract

Video-based person re-identification is a crucial task of matching video sequences of a person across multiple camera views. Generally, features directly extracted from a single frame suffer from occlusion, blur, illumination and posture changes. This leads to false activation or missing activation in some regions, which corrupts the appearance and motion representation. How to explore the abundant spatial-temporal information in video sequences is the key to solve this problem. To this end, we propose a Refining Recurrent Unit (RRU) that recovers the missing parts and suppresses noisy parts of the current frame’s features by referring historical frames. With RRU, the quality of each frame’s appearance representation is improved. Then we use the Spatial-Temporal clues Integration Module (STIM) to mine the spatial-temporal information from those upgraded features. Meanwhile, the multilevel training objective is used to enhance the capability of RRU and STIM. Through the cooperation of those modules, the spatial and temporal features mutually promote each other and the final spatial-temporal feature representation is more discriminative and robust. Extensive experiments are conducted on three challenging datasets, i.e., iLIDS-VID, PRID-2011 and MARS. The experimental results demonstrate that our approach outperforms existing state-of-the-art methods of video-based person re-identification on iLIDS-VID and MARS and achieves favorable results on PRID-2011.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 47 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Video Is Worth Three Views: Trigeminal Transformers for Video-Based Person Re-Identification;IEEE Transactions on Intelligent Transportation Systems;2024-09

2. Multi-scale spatio-temporal feature adaptive aggregation for video-based Person Re-identification;Knowledge-Based Systems;2024-09

3. Situational diversity in video person re-identification: introducing MSA-BUPT dataset;Complex & Intelligent Systems;2024-05-23

4. Progressive spatial–temporal transfer model for unsupervised person re-identification;International Journal of Multimedia Information Retrieval;2024-04-03

5. AA-RGTCN: reciprocal global temporal convolution network with adaptive alignment for video-based person re-identification;Frontiers in Neuroscience;2024-03-25