Scaling-Invariant Max-Filtering Enhancement Transformers for Efficient Visual Tracking
-
Published:2023-09-15
Issue:18
Volume:12
Page:3905
-
ISSN:2079-9292
-
Container-title:Electronics
-
language:en
-
Short-container-title:Electronics
Author:
Chen Zhen1ORCID, Xiong Xingzhong2, Meng Fanqin2, Xiao Xianbing1ORCID, Liu Jun23
Affiliation:
1. School of Automation and Information Engineering, Sichuan University of Science and Engineering, Yibin 644000, China 2. Artificial Intelligence Key Laboratory of Sichuan Province, Sichuan University of Science and Engineering, Yibin 644000, China 3. Key Laboratory of Higher Education of Sichuan Province for Enterprise Informationalization and Internet of Things, Sichuan University of Science and Engineering, Yibin 644000, China
Abstract
Real-time tracking is one of the most challenging problems in computer vision. Most Transformer-based trackers usually require expensive computational and storage power, which leads to these robust trackers being unable to achieve satisfactory real-time performance in resource-constrained devices. In this work, we propose a lightweight tracker, AnteaTrack. To localize the target more accurately, this paper presents a scaling-invariant max-filtering operator. It uses local max-pooling to filter the suspected target portion in overlapping sliding windows for enhancement while suppressing the background. For a more compact target bounding-box, this paper presents an upsampling module based on Pixel-Shuffle to increase the fine-grained expression of target features. In addition, AnteaTrack can run in real time at 47 frames per second (FPS) on a CPU. We tested AnteaTrack on five datasets, and a large number of experiments showed that AnteaTrack provides the most efficient solution compared to the same type of CPU real-time trackers.
Funder
Science and Technology Department of Sichuan Province Opening Project of the Key Laboratory of Higher Education of Sichuan Province for Enterprise Informationalization and Internet of Things Postgraduate Innovation Fund Project of Sichuan University of Science and Engineering Artificial Intelligence Key Laboratory of Sichuan Province
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference50 articles.
1. High-Speed Tracking with Kernelized Correlation Filters;Henriques;IEEE Trans. Pattern Anal. Mach. Intell.,2015 2. Danelljan, M., Bhat, G., Shahbaz Khan, F., and Felsberg, M. (2017, January 21–26). ECO: Efficient Convolution Operators for Tracking. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA. 3. Hua, G., and Jégou, H. (2016, January 8–16). Fully-Convolutional Siamese Networks for Object Tracking. Proceedings of the Computer Vision—ECCV 2016 Workshops, Amsterdam, The Netherlands. Lecture Notes in Computer Science. 4. Li, B., Yan, J., Wu, W., Zhu, Z., and Hu, X. (2018, January 18–22). High Performance Visual Tracking with Siamese Region Proposal Network. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA. 5. Li, B., Wu, W., Wang, Q., Zhang, F., Xing, J., and Yan, J. (2019, January 15–20). SiamRPN++: Evolution of Siamese Visual Tracking with Very Deep Networks. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|