1. Distilling the knowledge in a neural network;Hinton,2015
2. High performance convolution using sparsity and patterns for inference in deep convolutional neural networks;Amer,2021
3. Towards understanding knowledge distillation;Phuong,2019
4. Bayes conditional distribution estimation for knowledge distillation based on conditional mutual information;Ye,2024
5. A statistical perspective on distillation;Menon,2021