Progressively Hybrid Transformer for Multi-Modal Vehicle Re-Identification
Author:
Pan Wenjie1ORCID, Huang Linhan1, Liang Jianbao1, Hong Lan1, Zhu Jianqing12ORCID
Affiliation:
1. College of Engineering, Huaqiao University, Quanzhou 362021, China 2. Xiamen Yealink Network Technology Company Limited, No. 666, Hu’an Road, High-Tech Park, Huli District, Xiamen 361015, China
Abstract
Multi-modal (i.e., visible, near-infrared, and thermal-infrared) vehicle re-identification has good potential to search vehicles of interest in low illumination. However, due to the fact that different modalities have varying imaging characteristics, a proper multi-modal complementary information fusion is crucial to multi-modal vehicle re-identification. For that, this paper proposes a progressively hybrid transformer (PHT). The PHT method consists of two aspects: random hybrid augmentation (RHA) and a feature hybrid mechanism (FHM). Regarding RHA, an image random cropper and a local region hybrider are designed. The image random cropper simultaneously crops multi-modal images of random positions, random numbers, random sizes, and random aspect ratios to generate local regions. The local region hybrider fuses the cropped regions to let regions of each modal bring local structural characteristics of all modalities, mitigating modal differences at the beginning of feature learning. Regarding the FHM, a modal-specific controller and a modal information embedding are designed to effectively fuse multi-modal information at the feature level. Experimental results show the proposed method wins the state-of-the-art method by a larger 2.7% mAP on RGBNT100 and a larger 6.6% mAP on RGBN300, demonstrating that the proposed method can learn multi-modal complementary information effectively.
Funder
National Natural Science Foundation of China Natural Science Foundation for Outstanding Young Scholars of Fujian Province Collaborative Innovation Platform Project of Fuzhou-Xiamen-Quanzhou National Independent Innovation Demonstration Zone
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference66 articles.
1. Avola, D., Cinque, L., Fagioli, A., Foresti, G.L., Pannone, D., and Piciarelli, C. (2020). Bodyprint—A meta-feature based LSTM hashing model for person re-identification. Sensors, 20. 2. Paolanti, M., Romeo, L., Liciotti, D., Pietrini, R., Cenci, A., Frontoni, E., and Zingaretti, P. (2018). Person re-identification with RGB-D camera in top-view configuration through multiple nearest neighbor classifiers and neighborhood component features selection. Sensors, 18. 3. Uddin, M.K., Bhuiyan, A., Bappee, F.K., Islam, M.M., and Hasan, M. (2023). Person Re-Identification with RGB–D and RGB–IR Sensors: A Comprehensive Survey. Sensors, 23. 4. Trends in vehicle re-identification past, present, and future: A comprehensive review;Deng;Mathematics,2021 5. Zhu, X., Luo, Z., Fu, P., and Ji, X. (2020, January 14–19). Voc-reid: Vehicle re-identification based on vehicle-orientation-camera. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|