FQTrack:Object Tracking Method Based on a Feature-Enhanced Memory Network and Memory Quality Selection Mechanism-Reference-Cited by-同舟云学术

FQTrack:Object Tracking Method Based on a Feature-Enhanced Memory Network and Memory Quality Selection Mechanism

Published:2024-08-14 Issue:16 Volume:13 Page:3221
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Zhang Jianwei¹,Zhang Mengya¹,Zhang Huanlong²,Cai Zengyu³,Zhu Liang³^ORCID

Affiliation:

1. School of Software, Zhengzhou University of Light Industry, Zhengzhou 450000, China

2. School of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou 450000, China

3. School of Computer and Communication Engineering, Zhengzhou University of Light Industry, Zhengzhou 450000, China

Abstract

Visual object tracking technology is widely used in intelligent security, automatic driving and other fields, and also plays an important role in frontier fields such as human–computer interactions and virtual reality. The memory network improves the stability and accuracy of tracking by using historical frame information to assist in the positioning of the current frame in object tracking. However, the memory network is still insufficient in feature mining and the accuracy and robustness of the model may be reduced when using noisy observation samples to update it. In view of the above problems, we propose a new tracking framework, which uses the attention mechanism to establish a feature-enhanced memory network and combines cross-attention to aggregate the spatial and temporal context information of the target. The former introduces spatio-temporal adaptive attention and cross-spatial attention, embeds spatial location information into channels, realizes multi-scale feature fusion, dynamically emphasizes target location information, and obtains richer feature maps. The latter guides the tracker to focus on the area with the largest amount of information in the current frame to better distinguish the foreground and background. In addition, through the memory quality selection mechanism, the accuracy and richness of the feature samples are improved, thereby enhancing the adaptability and discrimination ability of the tracking model. Experiments on benchmark test sets such as OTB2015, TrackingNet, GOT-10k, LaSOT and UAV 123 show that this method achieves comparable performance with advanced trackers.

Funder

National Natural Science Foundation of China

Key Research and Development Special Project of Henan Province

Key Technologies R&D Program of Henan Province

Publisher

MDPI AG

Link

https://www.mdpi.com/2079-9292/13/16/3221/pdf

Reference57 articles.

1. Adaptive region proposal with channel regularization for robust object tracking;Lu;IEEE Trans. Circuits Syst. Video Technol.,2021

2. Cor-relation filter tracking via distractor-aware learning and multi-anchor detection;Chen;IEEE Trans. Circuits Syst. Video Technol.,2020

3. Xie, F., Wang, C., Wang, G., Cao, Y., Yang, W., and Zeng, W. (2022, January 19–24). Correlation-Aware Deep Tracking. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA.

4. Kadam, P., Fang, G., and Zou, J.J. (2024). Object Tracking Using Computer Vision: A Review. Computers, 13.

5. Guo, Q., Feng, W., Zhou, C., Huang, R., Wan, L., and Wang, S. (2017, January 22–29). Learning Dynamic Siamese Network for Visual Object Tracking. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy.