Abstract
Finding a template in a search image is an important task underlying many computer vision applications. This is typically solved by calculating a similarity map using features extracted from the separate images. Recent approaches perform template matching in a deep feature space, produced by a convolutional neural network (CNN), which is found to provide more tolerance to changes in appearance. Inspired by these findings, in this article we investigate whether enhancing the CNN’s encoding of shape information can produce more distinguishable features that improve the performance of template matching. By comparing features from the same CNN trained using different shape–texture training methods, we determined a feature space which improves the performance of most template matching algorithms. When combining the proposed method with the Divisive Input Modulation (DIM) template matching algorithm, its performance is greatly improved, and the resulting method produces state-of-the-art results on a standard benchmark. To confirm these results, we create a new benchmark and show that the proposed method outperforms existing techniques on this new dataset.
Funder
China Scholarship Council
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference41 articles.
1. Explaining away results in more robust visual tracking
2. More robust object tracking via shape and motion cue integration
3. Object recognition by template matching using correlations and phase angle method;Ahuja;Int. J. Adv. Res. Comput. Commun. Eng.,2013
4. R-fcn: Object detection via region-based fully convolutional networks;Dai;Proceedings of the Advances in Neural Information Processing Systems,2016
5. A taxonomy and evaluation of dense two-frame stereo correspondence algorithms;Scharstein;Int. J. Comput. Vis.,2002
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献