Affiliation:
1. College of Computer and Information Science, Chongqing Normal University, Chongqing 401331, China
2. School of Artificial Intelligence, Chongqing University of Technology, Chongqing 401135, China
Abstract
Few-shot learning refers to training a model with a few labeled data to effectively recognize unseen categories. Recently, numerous approaches have been suggested to improve the extraction of abundant feature information at hierarchical layers or multiple scales for similarity metrics, especially methods based on learnable relation networks, which have demonstrated promising results. However, the roles played by image features in relationship measurement vary at different layers, and effectively integrating features from different layers and multiple scales can improve the measurement capacity of the model. In light of this, we propose a novel method called dual-branch multi-scale relation networks with tutoring learning (DbMRNT) for few-shot learning. Specifically, we first generate deep multiple features using a multi-scale feature generator in Branch 1 while extracting features at hierarchical layers in Branch 2. Then, learnable relation networks are employed in both branches to measure the pairwise similarity of features at each scale or layer. Furthermore, to leverage the dominant role of deep features in the final classification, we introduce a tutorial learning module that enables Branch 1 to tutor the learning process of Branch 2. Ultimately, the relation scores of all scales and layers are integrated to obtain the classification results. Extensive experiments on popular few-shot learning datasets prove that our method outperforms other similar methods.
Funder
Chongqing Science and Technology Commission
Chongqing University of Technology graduate education high-quality development project
Chongqing University of Technology First-class undergraduate project
Chongqing University of Technology undergraduate education and teaching reform research project
Chongqing University of Technology—Chongqing LINGLUE Technology Co., LTD. Electronic Information (artificial intelligence) graduate joint training base
Postgraduate Education and Teaching Reform Research Project in Chongqing
Chongqing University of Technology—CISDI Chongqing Information Technology Co., LTD. Computer Technology graduate joint training base
Reference59 articles.
1. He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27–30). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.
2. ImageNet Classification with Deep Convolutional Neural Networks;Krizhevsky;Commun. ACM,2017
3. Simonyan, K., and Zisserman, A. (2015, January 7–9). Very deep convolutional networks for large-scale image recognition. Proceedings of the International Conference on Learning Representations, San Diego, CA, USA.
4. Hu, J., Shen, L., Albanie, S., Lin, Z., and Liu, J. (2018, January 18–23). Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.
5. Recognition-by-components: A theory of human image understanding;Biederman;Psychol. Rev.,1987