1. Marcin Andrychowicz, Misha Denil, Sergio Gómez Colmenarejo, Matthew W. Hoffman, David Pfau, Tom Schaul, Brendan Shillingford, and Nando de Freitas. 2016. Learning to learn by gradient descent by gradient descent. In Proceedings of the 30th International Conference on Neural Information Processing Systems (NIPS’16), Curran Associates Inc., Barcelona, Spain, 3988–3996.
2. Brown T Mann B Ryder N Subbiah M Kaplan JD Dhariwal P Neelakantan A Shyam P Sastry G Askell A others. 2020. Language models are few-shot learners. Advances in neural information processing systems 33:1877–1901
3. Baby steps towards few-shot learning with multiple semantics
4. Liu Q Zhang Y. 2020. Using sensory time-cue to enable unsupervised multimodal meta-learning. arXiv preprint arXiv:200907879
5. Nortje L Kamper H. 2020. Direct multimodal few-shot learning of speech and images. arXiv preprint arXiv:201205680