Affiliation:
1. Indian Institute of Technology Tirupati
Abstract
Abstract
Occlusionis a frequent phenomenon that hinders the task of visual object tracking. Since occlusion can be from any object and in any shape, data augmentation techniques will not greatly help identify or mitigate the tracker loss. Some of the existing works deal with occlusion only in an unsupervised manner. This paper proposes a generic deep learning framework for identifying occlusion in a given frame by formulating it as a supervised classification task for the first time. The proposed architecture introduces an ''occlusion classification" branch into supervised trackers. This branch helps in the effective learning of features and also provides occlusion status for each frame. A metric is proposed to measure the performance of trackers under occlusion at frame level. The efficacy of the proposed framework is demonstrated on two supervised tracking paradigms: One is from the most commonly used Siamese region proposal class of trackers, and another from the emerging Transformer-based trackers. This framework is tested on six diverse datasets (GOT-10k, LaSOT, OTB2015, TrackingNet, UAV123, and VOT2018), and it achieved significant improvements in performance over the corresponding baselines while performing on par with the state-of-the-art trackers. The contributions in this work are more generic, as any supervised tracker can easily adopt them.
Publisher
Research Square Platform LLC
Reference82 articles.
1. Ghsoun zin. {Generative Adversarial Networks for Online Visual Object T ersarial Networks for Online Visual Object Tracking Systems}. 2019, Wilfrid Laurier University
2. Gupta, Deepak K and Gavves, Efstratios and Smeulders, Arnold W M. {Tackling Occlusion in Siamese Tracking with Structured Dropouts}. 2020, cs.CV/2006.16571, arXiv
3. Yin, Yingjie and Zhang, Lei and Xu, De and Wang, Xingang. {Adversarial Feature Sampling Learning for Efficient Visual Tracking}. 2018, cs.CV/1809.04741, arXiv
4. Xin Wang and Zhiqiang Hou and Wangsheng Yu and Lei Pu and Zefenfen Jin and Xianxiang Qin (2018) Robust occlusion aware part based visual tracking with object scale adaptation. Patt. Reco. 81
5. Bertinetto, Luca and Valmadre, Jack and Henriques, Jo{\ {a}}o F and Vedaldi, Andrea and Torr, Philip H S (2016) {Fully-Convolutional Siamese Networks for Object Tracking}. arXiv preprint arXiv:1606.09549