Author:
Wang Dong,Wang Qi,Min Weidong,Gai Di,Han Qing,Li Longfei,Geng Yuhan
Abstract
AbstractDistinguishing identity-unrelated background information from discriminative identity information poses a challenge in unsupervised vehicle re-identification (Re-ID). Re-ID models suffer from varying degrees of background interference caused by continuous scene variations. The recently proposed segment anything model (SAM) has demonstrated exceptional performance in zero-shot segmentation tasks. The combination of SAM and vehicle Re-ID models can achieve efficient separation of vehicle identity and background information. This paper proposes a method that combines SAM-driven mask autoencoder (MAE) pre-training and background-aware meta-learning for unsupervised vehicle Re-ID. The method consists of three sub-modules. First, the segmentation capacity of SAM is utilized to separate the vehicle identity region from the background. SAM cannot be robustly employed in exceptional situations, such as those with ambiguity or occlusion. Thus, in vehicle Re-ID downstream tasks, a spatially-constrained vehicle background segmentation method is presented to obtain accurate background segmentation results. Second, SAM-driven MAE pre-training utilizes the aforementioned segmentation results to select patches belonging to the vehicle and to mask other patches, allowing MAE to learn identity-sensitive features in a self-supervised manner. Finally, we present a background-aware meta-learning method to fit varying degrees of background interference in different scenarios by combining different background region ratios. Our experiments demonstrate that the proposed method has state-of-the-art performance in reducing background interference variations.
Publisher
Springer Science and Business Media LLC
Reference59 articles.
1. Lei, J.; Qin, T.; Peng, B.; Li, W.; Pan, Z.; Shen, H.; Kwong, S. Reducing background induced domain shift for adaptive person re-identification. IEEE Transactions on Industrial Informatics Vol. 19, No. 6, 7377–7388, 2023.
2. Zhang, G.; Zhang, H.; Lin, W.; Chandran, A. K.; Jing, X. Camera contrast learning for unsupervised person re-identification. IEEE Transactions on Circuits and Systems for Video Technology Vol. 33, No. 8, 4096–4107, 2023.
3. Zhu, K.; Guo, H.; Liu, S.; Wang, J.; Tang, M. Learning semantics-consistent stripes with self-refinement for person re-identification. IEEE Transactions on Neural Networks and Learning Systems Vol. 34, No. 11, 8531–8542, 2023.
4. Lecture Notes in Computer Science;M Wu,2020
5. Munir, A.; Martinel, N.; Micheloni, C. Oriented splits network to distill background for vehicle reidentification. In: Proceedings of the 17th IEEE International Conference on Advanced Video and Signal Based Surveillance, 1–8, 2021.