Partial Is Better Than All: Revisiting Fine-tuning Strategy for Few-shot Learning-Reference-Cited by-同舟云学术

Partial Is Better Than All: Revisiting Fine-tuning Strategy for Few-shot Learning

Published:2021-05-18 Issue:11 Volume:35 Page:9594-9602
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Shen Zhiqiang,Liu Zechun,Qin Jie,Savvides Marios,Cheng Kwang-Ting

Abstract

The goal of few-shot learning is to learn a classifier that can recognize unseen classes from limited support data with labels. A common practice for this task is to train a model on the base set first and then transfer to novel classes through fine-tuning or meta-learning. However, as the base classes have no overlap to the novel set, simply transferring whole knowledge from base data is not an optimal solution since some knowledge in the base model may be biased or even harmful to the novel class. In this paper, we propose to transfer partial knowledge by freezing or fine-tuning particular layer(s) in the base model. Specifically, layers will be imposed different learning rates if they are chosen to be fine-tuned, to control the extent of preserved transferability. To determine which layers to be recast and what values of learning rates for them, we introduce an evolutionary search based method that is efficient to simultaneously locate the target layers and determine their individual learning rates. We conduct extensive experiments on CUB and mini-ImageNet to demonstrate the effectiveness of our proposed method. It achieves the state-of-the-art performance on both meta-learning and non-meta based frameworks. Furthermore, we extend our method to the conventional pre-training + fine-tuning paradigm and obtain consistent improvement.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 39 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Edge-labeling based modified gated graph network for few-shot learning;Pattern Recognition;2024-06

2. Local feature semantic alignment network for few-shot image classification;Multimedia Tools and Applications;2024-01-31

3. Few-Shot Classification with Semantic Augmented Activators;Pattern Recognition and Computer Vision;2023-12-29

4. FSCD-Net: A Few-Shot Stego Cross-Domain Net for Image Steganalysis;Pattern Recognition and Computer Vision;2023-12-26

5. Learning and Adapting Diverse Representations for Cross-domain Few-shot Learning;2023 IEEE International Conference on Data Mining Workshops (ICDMW);2023-12-04