1. Song Han et al. 2015. Deep compression: Compressing deep neural networks with pruning trained quantization and huffman coding. arXiv preprint arXiv:1510.00149 (2015). Song Han et al. 2015. Deep compression: Compressing deep neural networks with pruning trained quantization and huffman coding. arXiv preprint arXiv:1510.00149 (2015).
2. Song Han et al. 2015. Learning both weights and connections for efficient neural network. In Advances in neural information processing systems. 1135--1143. Song Han et al. 2015. Learning both weights and connections for efficient neural network. In Advances in neural information processing systems. 1135--1143.
3. EIE: Efficient Inference Engine on Compressed Deep Neural Network
4. ESE
5. Quantized neural networks: Training neural networks with low precision weights and activations;Itay Hubara;The Journal of Machine Learning Research,2017