PMG—Pyramidal Multi-Granular Matching for Text-Based Person Re-Identification
-
Published:2023-10-30
Issue:21
Volume:13
Page:11876
-
ISSN:2076-3417
-
Container-title:Applied Sciences
-
language:en
-
Short-container-title:Applied Sciences
Author:
Liu Chao1ORCID, Xue Jingyi2, Wang Zijie2, Zhu Aichun2
Affiliation:
1. School of Intelligent Science and Control Engineering, Jinling Institute of Technology, Nanjing 211199, China 2. School of Computer Science and Technology, Nanjing Tech University, Nanjing 211816, China
Abstract
Given a textual query, text-based person re-identification is supposed to search for the targeted pedestrian images from a large-scale visual database. Due to the inherent heterogeneity between different modalities, it is challenging to measure the cross-modal affinity between visual and textual data. Existing works typically employ single-granular methods to extract local features and align image regions with relevant words/phrases. Nevertheless, the limited robustness of single-granular methods cannot adapt to the imprecision and variances of visual and textual features, which are usually influenced by the background clutter, position transformation, posture diversity, and occlusion in surveillance videos, thereby leading to the deterioration of cross-modal matching accuracy. In this paper, we propose a Pyramidal Multi-Granular matching network (PMG) that incorporates a gradual transition process between the coarsest global information and the finest local information by a coarse-to-fine pyramidal method for multi-granular cross-modal features extraction and affinities learning. For each body part of a pedestrian, PMG is adequate in ensuring the integrity of local information while minimizing the surrounding interference signals at a certain scale and can adapt to capture discriminative signals of different body parts and achieve semantically alignment between image strips with relevant textual descriptions, thus suppressing the variances of feature extraction and improving the robustness of feature matching. Comprehensive experiments are conducted on the CUHK-PEDES and RSTPReid datasets to validate the effectiveness of the proposed method and results show that PMG outperforms state-of-the-art (SOTA) methods significantly and yields competitive accuracy of cross-modal retrieval.
Funder
Future Network Scientific Research Fund Project Postgraduate Research & Practice Innovation Program of Jiangsu Province, China
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference36 articles.
1. Sun, Y., Zheng, L., Yang, Y., Tian, Q., and Wang, S. (2018, January 8–14). Beyond part models: Person retrieval with refined part pooling (and A strong convolutional baseline). Proceedings of the 15th European Conference on Computer Vision (ECCV), Munich, Germany. 2. Deep Representation Learning with Part Loss for Person Re-identification;Yao;IEEE Trans. Image Process.,2019 3. Xiong, M., Gao, Z., Hu, R., Chen, J., He, R., Cai, H., and Peng, T. (2022). A Lightweight Efficient Person Re-Identification Method Based on Multi-Attribute Feature Generation. Appl. Sci., 12. 4. Xie, H., Luo, H., Gu, J., and Jiang, W. (2022). Unsupervised Domain Adaptive Person Re-Identification via Intermediate Domains. Appl. Sci., 12. 5. Wang, C., Zhang, C., Feng, Y., Ji, Y., and Ding, J. (2022). Learning Visible Thermal Person Re-identification via Spatial Dependence and Dual-constraint Loss. Entropy, 24.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|