Fully Self-Supervised Out-of-Domain Few-Shot Learning with Masked Autoencoders
-
Published:2024-01-16
Issue:1
Volume:10
Page:23
-
ISSN:2313-433X
-
Container-title:Journal of Imaging
-
language:en
-
Short-container-title:J. Imaging
Author:
Walsh Reece1, Osman Islam1, Abdelaziz Omar1ORCID, Shehata Mohamed S.1
Affiliation:
1. Irving K. Barber Faculty of Science, University of British Columbia, Kelowna, BC V1V 1V7, Canada
Abstract
Few-shot learning aims to identify unseen classes with limited labelled data. Recent few-shot learning techniques have shown success in generalizing to unseen classes; however, the performance of these techniques has also been shown to degrade when tested on an out-of-domain setting. Previous work, additionally, has also demonstrated increasing reliance on supervised finetuning in an off-line or online capacity. This paper proposes a novel, fully self-supervised few-shot learning technique (FSS) that utilizes a vision transformer and masked autoencoder. The proposed technique can generalize to out-of-domain classes by finetuning the model in a fully self-supervised method for each episode. We evaluate the proposed technique using three datasets (all out-of-domain). As such, our results show that FSS has an accuracy gain of 1.05%, 0.12%, and 1.28% on the ISIC, EuroSat, and BCCD datasets, respectively, without the use of supervised training.
Funder
Natural Sciences and Engineering Research Council of Canada
Reference50 articles.
1. Bateni, P., Goyal, R., Masrani, V., Wood, F., and Sigal, L. (2020, January 14–19). Improved few-shot visual classification. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA. 2. Hu, S.X., Li, D., Stühmer, J., Kim, M., and Hospedales, T.M. (2022, January 18–24). Pushing the limits of simple pipelines for few-shot learning: External data and fine-tuning make a difference. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA. 3. Rodríguez, P., Laradji, I., Drouin, A., and Lacoste, A. (2020, January 23–28). Embedding propagation: Smoother manifold for few-shot classification. Proceedings of the Computer Vision—ECCV 2020: 16th European Conference, Glasgow, UK. 4. Prototypical networks for few-shot learning;Snell;Adv. Neural Inf. Process. Syst.,2017 5. Munkhdalai, T., Yuan, X., Mehri, S., and Trischler, A. (2018, January 10–15). Rapid adaptation with conditionally shifted neurons. Proceedings of the International Conference on Machine Learning, Stockholm, Sweden.
|
|