Real-Time RGBT Target Tracking Based on Attention Mechanism

Author:

Zhao Qian1,Liu Jun2,Wang Junjia1,Xiong Xingzhong3

Affiliation:

1. School of Automation and Information Engineering, Sichuan University of Science and Engineering, Yibin 644000, China

2. Key Laboratory of Higher Education of Sichuan Province for Enterprise Informationalization and Internet of Things, Sichuan University of Science and Engineering, Yibin 644000, China

3. Artificial Intelligence Key Laboratory of Sichuan Province, Sichuan University of Science and Engineering, Yibin 644000, China

Abstract

The fusion tracking of RGB and thermal infrared image (RGBT) has attracted widespread interest within target tracking by leveraging the complementing benefits of information from both visible and thermal infrared modalities, but achieving robustness while operating in real time remains a challenge. Aimed at this problem, this paper proposes a real-time tracking network based on the attention mechanism, which can improve the tracking speed with a smaller model, and at the same time, introduce the attention mechanism in the module to strengthen the attention to the important features, which can guarantee a certain tracking accuracy. Specifically, the modal features of visible and thermal infrared are extracted separately by using the backbone of the dual-stream structure; then, the important features in the two modes are selected and enhanced by using the channel attention mechanism in the feature selection enhancement module (FSEM) and the Transformer, while noise is reduced by using gating circuits. Finally, the final enhancement fusion is performed by using the spatial channel adaptive adjustment fusion module (SCAAM) in both the spatial and channel dimensions. The PR/SR of the proposed algorithm tested on the GTOT, RGBT234 and LasHeR datasets are 90.0%/73.0%, 84.4%/60.2%, and 46.8%/34.3%, respectively, and generally good tracking accuracy has been achieved, with a speed of up to 32.3067 fps, meeting the model’s real-time requirement.

Funder

Research on key technology of intelligent information processing of energy internet for resilient city evaluation

Opening Project of Key Laboratory of Higher Education of Sichuan Province for Enterprise Informationalization and Internet of Things

Publisher

MDPI AG

Reference60 articles.

1. Tang, Z., Xu, T., and Wu, X.-J. (2022). A survey for deep rgbt tracking. arXiv.

2. Thermal Infrared Target Tracking: A Comprehensive Review;Yuan;IEEE Trans. Instrum. Meas.,2023

3. Schnelle, S.R., and Chan, A.L. (2011, January 5–8). Enhanced target tracking through infrared-visible image fusion. Proceedings of the 14th International Conference on Information Fusion, Chicago, IL, USA.

4. Fusing concurrent visible and infrared videos for improved tracking performance;Chan;Opt. Eng.,2013

5. DSiamMFT: An RGB-T fusion tracking method via dynamic Siamese networks using multi-layer feature fusion;Zhang;Signal Process. Image Commun.,2020

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3