Attention and Pixel Matching in RGB-T Object Tracking
-
Published:2023-03-29
Issue:7
Volume:11
Page:1646
-
ISSN:2227-7390
-
Container-title:Mathematics
-
language:en
-
Short-container-title:Mathematics
Author:
Li Da1, Zhang Yao1, Chen Min1, Chai Haoxiang1
Affiliation:
1. School of Information Engineering, Wuhan University of Technology, Wuhan 430070, China
Abstract
Visual object tracking using visible light images and thermal infrared images, named RGB-T tracking, has recently attracted increasing attention in the tracking community. Deep neural network-based methods becoming the most popular RGB-T trackers, still have to balance the robustness and the speed of calculation. A novel tracker with Siamese architecture is proposed to obtain the accurate object location and meet the real-time requirements. Firstly, a multi-modal weight penalty module is designed to assign different weights to the RGB and thermal infrared features. Secondly, a new pixel matching module is proposed to calculate the similarity between each pixel on the search and the template features, which can avoid bringing excessive background information versus the regular cross-correlation operation. Finally, an improved anchor-free bounding box prediction network is put forward to further reduce the interference of the background information. The experimental results on the standard RGB-T tracking benchmark datasets show that the proposed method achieves better precision and success rate with a speed of over 34 frames per second which satisfies the real-time tracking.
Funder
National Natural Science Foundation of China
Subject
General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)
Reference39 articles.
1. Visual object tracking: A survey;Chen;Comput. Vis. Image Underst.,2022 2. Object fusion tracking based on visible and infrared images: A comprehensive review;Zhang;Inf. Fusion,2020 3. Nam, H., and Han, B. (July, January 26). Learning multi-domain convolutional neural networks for visual tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA. 4. Li, C., Liu, L., Lu, A., Ji, Q., and Tang, J. (2020, January 23–28). Challenge-aware RGBT tracking. Proceedings of the Computer Vision—ECCV 2020, Proceedings of the 16th European Conference, Glasgow, UK. Part XXII. 5. Wang, C., Xu, C., Cui, Z., Zhou, L., Zhang, T., Zhang, X., and Yang, J. (2020, January 14–19). Cross-modal pattern-propagation for RGB-T tracking. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Online.
|
|