Affiliation:
1. College of Information Science and Engineering, Northeastern University, Shenyang 110819, China
Abstract
The main task in visual object tracking is to track a moving object in an image sequence. In this process, the object’s trajectory and behavior can be described by calculating the object’s position, velocity, acceleration, and other parameters or by memorizing the position of the object in each frame of the corresponding video. Therefore, visual object tracking can complete many more advanced tasks, has great performance in relation to real scenes, and is widely used in automated driving, traffic monitoring, human–computer interaction, and so on. Siamese-network-based trackers have been receiving a great deal of attention from the tracking community, but they have many drawbacks. This paper analyzes the shortcomings of the Siamese network tracker in detail, uses the method of feature multi-scale fusion to improve the Siamese network tracker, and proposes a new target-tracking framework to address its shortcomings. In this paper, a feature map with low-resolution but strong semantic information and a feature map with high-resolution and rich spatial information are integrated to improve the model’s ability to depict an object, and the problem of scale change is solved by fusing features at different scales. Furthermore, we utilize the 3D Max Filtering module to suppress repeated predictions of features at different scales. Finally, our experiments conducted on the four tracking benchmarks OTB2015, VOT2016, VOT2018, and GOT10K show that the proposed algorithm effectively improves the tracking accuracy and robustness of the system.
Funder
National Key Research and Development Program of China
Fundamental Research Funds for the Central Universities
Subject
Information Systems and Management,Computer Networks and Communications,Modeling and Simulation,Control and Systems Engineering,Software
Reference50 articles.
1. Real time object detection and tracking system for video surveillance system;Jha;Multimed. Tools Appl.,2021
2. Detection and Tracking of Moving Objects at Road Intersections Using a 360-Degree Camera for Driver Assistance and Automated Driving;Premachandra;IEEE Access,2020
3. Human–Computer Interaction Based Visual Feedback System for Augmentative and Alternative Communication;Liu;Int. J. Speech Technol.,2022
4. A Survey of Single Object Tracking Algorithms Based on Deep Learning;Wang;Comput. Syst. Appl.,2022
5. A Survey of Object Tracking Algorithms;Meng;IEEE/CAA J. Autom. Sin.,2019