1. Few-shot learning with metric-agnostic conditional embeddings;hilliard,2018
2. Like what you like: Knowledge distill via neuron selectivity transfer;huang,2017
3. Distilling the knowledge in a neural network;hinton,2015
4. Meta-dmoe: Adapting to domain shift by meta-distillation from mixture-of-experts;zhong;Neural Information Processing System,2022