1. Channel-wise Mixed-precision Assignment for DNN Inference on Constrained Edge Nodes
2. Memory-Driven Mixed Low Precision Quantization for Enabling Deep Network Inference on Microcontrollers;rusci;Proceedings of Machine Learning and Systems,2020
3. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications;howard;CoRR,2017
4. Trained Quantization Thresholds for Accurate and Efficient Fixed-Point Inference of Deep Neural Networks;jain;Proceedings of Machine Learning and Systems,2020
5. BitPruning: Learning Bitlengths for Aggressive and Accurate Quantization;nikolic;CoRR,2020