Affiliation:
1. School of Mechanical and Electrical Engineering, China Jiliang University, Hangzhou 310018, China
Abstract
Person re-identification aims to identify the same pedestrians captured by various cameras from different viewpoints in multiple scenarios. Occlusion is the toughest problem for practical applications. In video-based ReID tasks, motion information can be easily obtained from sampled frames, and provide discriminative human part representations. However, most motion-based methodologies are designed for video frames which are not suitable for processing single static image input. In this paper, we propose a Motion-Aware Fusion (MAF) network, aiming to acquire motion information from static images in order to improve the performance of ReID tasks. Specifically, a visual adapter is introduced to enable visual feature extraction, either from image or video data. We design a motion consistency task to guide the motion-aware transformer to learn representative human-part motion information and greatly improve the learning quality of features of occluded pedestrians. Extensive experiments on popular holistic, occluded, and video datasets demonstrate the effectiveness of our proposed method. This method outperforms state-of-the-art approaches by improving the mean average precision (mAP) by 1.5% and rank-1 accuracy by 1.2% on the challenging Occluded-REID dataset. At the same time, it surpasses other methods on the MARS dataset with an improvement of 0.2% in mAP and 0.1% in rank-1 accuracy.
Reference51 articles.
1. Yang, Y., Yang, J., Yan, J., Liao, S., Yi, D., and Li, S.Z. (2014, January 6–12). Salient color names for person re-identification. Proceedings of the ECCV, Zurich, Switzerland.
2. Liao, S., Hu, Y., Zhu, X., and Li, S.Z. (2015, January 1–12). Person re-identification by local maximal occurrence representation and metric learning. Proceedings of the CVPR, Boston, MA, USA.
3. Reidentification by relative distance comparison;Zheng;IEEE Trans. Pattern Anal. Mach. Intell.,2012
4. Robust structural sparse tracking;Zhang;IEEE Trans. Pattern Anal. Mach. Intell.,2018
5. Learning multi-task correlation particle filters for visual tracking;Zhang;IEEE Trans. Pattern Anal. Mach. Intell.,2018