Author:
Wang Biyan,Guo Baolong,Li Qianying,Liu Runzhi
Abstract
Abstract
The MDNet algorithm works well on tracking problems of the video sequence, but the speed is very slow. We have made some improvements to accelerate the process of feature extraction. It enhances the expressive ability of the feature map by removing the max pooling layer and using the method of expanding convolution to increase the receptive field of each point on the feature map. In addition, MDNet is building on CNN, and there are problems that similar targets have a large interference to the results. To address this problem, We use RNN to capture the long-term dependence of the target before and after the target data in the sequence data, and introduce the RNN to model the structure information of the target object, and then fuse the RNN feature and CNN feature of the tracked target object. In addition, another new loss term is introduced to make the targets in different domains away from each other in the shared feature space, thereby improving the algorithm’s ability to discriminate similar interferers. Compared with MDNet, our improved algorithm is much faster and the accuracy is improved.
Subject
General Physics and Astronomy
Reference14 articles.
1. Object tracking: A survey[J];Yilmaz;Acm Computing Surveys,2006
2. Backpropagation Applied to Handwritten Zip Code Recognition[J];Lecun;Neural Computation,1989
3. Rich feature hierarchies for object detection and semantic segmentation[C];Girshick,2014
4. ImageNet Classification with Deep Convolutional Neural Networks[J];Krizhevsky;Advances in neural information processing systems,2012
5. HD-CNN: Hierarchical Deep Convolutional Neural Networks for Large Scale Visual Recognition[C];Yan,2016
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. An improved MDNet target tracking algorithm;Second International Conference on Digital Signal and Computer Communications (DSCC 2022);2022-08-04