Affiliation:
1. School of Automation and Information Engineering, Sichuan University of Science and Engineering, Yibin 644000, China
2. Key Laboratory of Higher Education of Sichuan Province for Enterprise Informationalization and Internet of Things, Sichuan University of Science and Engineering, Yibin 644000, China
3. Artificial Intelligence Key Laboratory of Sichuan Province, Sichuan University of Science and Engineering, Yibin 644000, China
Abstract
The fusion tracking of RGB and thermal infrared image (RGBT) has attracted widespread interest within target tracking by leveraging the complementing benefits of information from both visible and thermal infrared modalities, but achieving robustness while operating in real time remains a challenge. Aimed at this problem, this paper proposes a real-time tracking network based on the attention mechanism, which can improve the tracking speed with a smaller model, and at the same time, introduce the attention mechanism in the module to strengthen the attention to the important features, which can guarantee a certain tracking accuracy. Specifically, the modal features of visible and thermal infrared are extracted separately by using the backbone of the dual-stream structure; then, the important features in the two modes are selected and enhanced by using the channel attention mechanism in the feature selection enhancement module (FSEM) and the Transformer, while noise is reduced by using gating circuits. Finally, the final enhancement fusion is performed by using the spatial channel adaptive adjustment fusion module (SCAAM) in both the spatial and channel dimensions. The PR/SR of the proposed algorithm tested on the GTOT, RGBT234 and LasHeR datasets are 90.0%/73.0%, 84.4%/60.2%, and 46.8%/34.3%, respectively, and generally good tracking accuracy has been achieved, with a speed of up to 32.3067 fps, meeting the model’s real-time requirement.
Funder
Research on key technology of intelligent information processing of energy internet for resilient city evaluation
Opening Project of Key Laboratory of Higher Education of Sichuan Province for Enterprise Informationalization and Internet of Things
Reference60 articles.
1. Tang, Z., Xu, T., and Wu, X.-J. (2022). A survey for deep rgbt tracking. arXiv.
2. Thermal Infrared Target Tracking: A Comprehensive Review;Yuan;IEEE Trans. Instrum. Meas.,2023
3. Schnelle, S.R., and Chan, A.L. (2011, January 5–8). Enhanced target tracking through infrared-visible image fusion. Proceedings of the 14th International Conference on Information Fusion, Chicago, IL, USA.
4. Fusing concurrent visible and infrared videos for improved tracking performance;Chan;Opt. Eng.,2013
5. DSiamMFT: An RGB-T fusion tracking method via dynamic Siamese networks using multi-layer feature fusion;Zhang;Signal Process. Image Commun.,2020