Affiliation:
1. School of Automation and Information Engineering, Sichuan University of Science and Engineering, Yibin 644000, China
2. Artificial Intelligence Key Laboratory of Sichuan Province, Sichuan University of Science and Engineering, Yibin 644000, China
Abstract
The fusion tracking of RGB and thermal infrared image (RGBT) is paid wide attention to due to their complementary advantages. Currently, most algorithms obtain modality weights through attention mechanisms to integrate multi-modalities information. They do not fully exploit the multi-scale information and ignore the rich contextual information among features, which limits the tracking performance to some extent. To solve this problem, this work proposes a new multi-scale feature interactive fusion network (MSIFNet) for RGBT tracking. Specifically, we use different convolution branches for multi-scale feature extraction and aggregate them through the feature selection module adaptively. At the same time, a Transformer interactive fusion module is proposed to build long-distance dependencies and enhance semantic representation further. Finally, a global feature fusion module is designed to adjust the global information adaptively. Numerous experiments on publicly available GTOT, RGBT234, and LasHeR datasets show that our algorithm outperforms the current mainstream tracking algorithms.
Funder
the Science and Technology Department of Sichuan Province
the Postgraduate Innovation Fund Project of Sichuan University of Science and Engineering
the Artificial Intelligence Key Laboratory of Sichuan Province
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference55 articles.
1. PAC interaction inspection using real-time contact point tracking;Shen;IEEE Trans. Instrum. Meas.,2018
2. Mehmood, K., Jalil, A., Ali, A., Khan, B., Murad, M., Khan, W.U., and He, Y. (2020). Context-aware and occlusion handling mechanism for online visual object tracking. Electronics, 10.
3. Thermal cameras and applications: A survey;Gade;Mach. Vis. Appl.,2014
4. Schnelle, S.R., and Chan, A.L. (2011, January 5–8). Enhanced target tracking through infrared-visible image fusion. Proceedings of the 14th International Conference on Information Fusion, Chicago, IL, USA.
5. Chan, A.L., and Schnelle, S.R.J.O.E. (2013). Fusing concurrent visible and infrared videos for improved tracking performance. Opt. Eng., 52.
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献