1. Distilling the knowledge in a neural network;Hinton,2015
2. FitNets: Hints for thin deep nets;Romero
3. A closer look at model adaptation using feature distortion and simplicity bias;Trivedi
4. Fine-tuning can distort pretrained features and underperform out-of-distribution;Kumar
5. A Foundation Model for Music Informatics