FQTrack:Object Tracking Method Based on a Feature-Enhanced Memory Network and Memory Quality Selection Mechanism
-
Published:2024-08-14
Issue:16
Volume:13
Page:3221
-
ISSN:2079-9292
-
Container-title:Electronics
-
language:en
-
Short-container-title:Electronics
Author:
Zhang Jianwei1, Zhang Mengya1, Zhang Huanlong2, Cai Zengyu3, Zhu Liang3ORCID
Affiliation:
1. School of Software, Zhengzhou University of Light Industry, Zhengzhou 450000, China 2. School of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou 450000, China 3. School of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450000, China
Abstract
Visual object tracking technology is widely used in intelligent security, automatic driving and other fields, and also plays an important role in frontier fields such as human–computer interactions and virtual reality. The memory network improves the stability and accuracy of tracking by using historical frame information to assist in the positioning of the current frame in object tracking. However, the memory network is still insufficient in feature mining and the accuracy and robustness of the model may be reduced when using noisy observation samples to update it. In view of the above problems, we propose a new tracking framework, which uses the attention mechanism to establish a feature-enhanced memory network and combines cross-attention to aggregate the spatial and temporal context information of the target. The former introduces spatio-temporal adaptive attention and cross-spatial attention, embeds spatial location information into channels, realizes multi-scale feature fusion, dynamically emphasizes target location information, and obtains richer feature maps. The latter guides the tracker to focus on the area with the largest amount of information in the current frame to better distinguish the foreground and background. In addition, through the memory quality selection mechanism, the accuracy and richness of the feature samples are improved, thereby enhancing the adaptability and discrimination ability of the tracking model. Experiments on benchmark test sets such as OTB2015, TrackingNet, GOT-10k, LaSOT and UAV 123 show that this method achieves comparable performance with advanced trackers.
Funder
National Natural Science Foundation of China Key Research and Development Special Project of Henan Province Key Technologies R&D Program of Henan Province
Reference57 articles.
1. Adaptive region proposal with channel regularization for robust object tracking;Lu;IEEE Trans. Circuits Syst. Video Technol.,2021 2. Cor-relation filter tracking via distractor-aware learning and multi-anchor detection;Chen;IEEE Trans. Circuits Syst. Video Technol.,2020 3. Xie, F., Wang, C., Wang, G., Cao, Y., Yang, W., and Zeng, W. (2022, January 19–24). Correlation-Aware Deep Tracking. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA. 4. Kadam, P., Fang, G., and Zou, J.J. (2024). Object Tracking Using Computer Vision: A Review. Computers, 13. 5. Guo, Q., Feng, W., Zhou, C., Huang, R., Wan, L., and Wang, S. (2017, January 22–29). Learning Dynamic Siamese Network for Visual Object Tracking. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy.
|
|