Dual-Branch Multi-Scale Relation Networks with Tutorial Learning for Few-Shot Learning-Reference-Cited by-同舟云学术

Dual-Branch Multi-Scale Relation Networks with Tutorial Learning for Few-Shot Learning

Published:2024-02-17 Issue:4 Volume:14 Page:1599
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Xu Chuanyun¹²^ORCID,Wang Hang²,Zhang Yang¹,Zhou Zheng²,Li Gang²

Affiliation:

1. College of Computer and Information Science, Chongqing Normal University, Chongqing 401331, China

2. School of Artificial Intelligence, Chongqing University of Technology, Chongqing 401135, China

Abstract

Few-shot learning refers to training a model with a few labeled data to effectively recognize unseen categories. Recently, numerous approaches have been suggested to improve the extraction of abundant feature information at hierarchical layers or multiple scales for similarity metrics, especially methods based on learnable relation networks, which have demonstrated promising results. However, the roles played by image features in relationship measurement vary at different layers, and effectively integrating features from different layers and multiple scales can improve the measurement capacity of the model. In light of this, we propose a novel method called dual-branch multi-scale relation networks with tutoring learning (DbMRNT) for few-shot learning. Specifically, we first generate deep multiple features using a multi-scale feature generator in Branch 1 while extracting features at hierarchical layers in Branch 2. Then, learnable relation networks are employed in both branches to measure the pairwise similarity of features at each scale or layer. Furthermore, to leverage the dominant role of deep features in the final classification, we introduce a tutorial learning module that enables Branch 1 to tutor the learning process of Branch 2. Ultimately, the relation scores of all scales and layers are integrated to obtain the classification results. Extensive experiments on popular few-shot learning datasets prove that our method outperforms other similar methods.

Funder

Chongqing Science and Technology Commission

Chongqing University of Technology graduate education high-quality development project

Chongqing University of Technology First-class undergraduate project

Chongqing University of Technology undergraduate education and teaching reform research project

Chongqing University of Technology—Chongqing LINGLUE Technology Co., LTD. Electronic Information (artificial intelligence) graduate joint training base

Postgraduate Education and Teaching Reform Research Project in Chongqing

Chongqing University of Technology—CISDI Chongqing Information Technology Co., LTD. Computer Technology graduate joint training base

Publisher

MDPI AG

Link

https://www.mdpi.com/2076-3417/14/4/1599/pdf

Reference59 articles.

1. He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27–30). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.

2. ImageNet Classification with Deep Convolutional Neural Networks;Krizhevsky;Commun. ACM,2017

3. Simonyan, K., and Zisserman, A. (2015, January 7–9). Very deep convolutional networks for large-scale image recognition. Proceedings of the International Conference on Learning Representations, San Diego, CA, USA.

4. Hu, J., Shen, L., Albanie, S., Lin, Z., and Liu, J. (2018, January 18–23). Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.

5. Recognition-by-components: A theory of human image understanding;Biederman;Psychol. Rev.,1987