1. Zhu, M., Gupta, S.: To prune or not to prune: exploring the efficacy of pruning for model compression (2017)
2. Sung, W., Shin, S., Hwang, K.: Resiliency of deep neural networks under quantization (2015)
3. Bucila, C., Caruana, R., Niculescu-Mizil, A.: Model compression, vol. 10(1145), pp. 535–541 (2006)
4. Ba, L.J., Caruana, R.: Do deep nets really need to be deep? Adv. Neural. Inf. Process. Syst. 3, 2654–2662 (2014)
5. Hinton, G., Vinyals, O., Dean, J.: Distilling the knowledge in a neural network (2015)