Free Bits: Latency Optimization of Mixed-Precision Quantized Neural Networks on the Edge-Reference-Cited by-同舟云学术

Free Bits: Latency Optimization of Mixed-Precision Quantized Neural Networks on the Edge

Published:2023-06-11 Issue: Volume: Page:
ISSN:
Container-title:2023 IEEE 5th International Conference on Artificial Intelligence Circuits and Systems (AICAS)
language:
Short-container-title:

Author:

Rutishauser Georg¹,Conti Francesco²,Benini Luca¹

Affiliation:

1. ETH Zürich,Departement Informationstechnologie und Elektrotechnik,Switzerland

2. Università di Bologna,Dipartimento di Ingegneria Dell’Energia Elettrica e Dell’Informazione,Bologna,Italy

Funder

Horizon Europe

Publisher

IEEE

Link

http://xplorestaging.ieee.org/ielx7/10168547/10168548/10168577.pdf?arnumber=10168577

Reference18 articles.

1. Channel-wise Mixed-precision Assignment for DNN Inference on Constrained Edge Nodes

2. Memory-Driven Mixed Low Precision Quantization for Enabling Deep Network Inference on Microcontrollers;rusci;Proceedings of Machine Learning and Systems,2020

3. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications;howard;CoRR,2017

4. Trained Quantization Thresholds for Accurate and Efficient Fixed-Point Inference of Deep Neural Networks;jain;Proceedings of Machine Learning and Systems,2020

5. BitPruning: Learning Bitlengths for Aggressive and Accurate Quantization;nikolic;CoRR,2020

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Reducing False Alarms in Wearable Seizure Detection With EEGformer: A Compact Transformer Model for MCUs;IEEE Transactions on Biomedical Circuits and Systems;2024-06

2. Edge Inference with Fully Differentiable Quantized Mixed Precision Neural Networks;2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV);2024-01-03

3. Flexible and Fully Quantized Lightweight TinyissimoYOLO for Ultra-Low-Power Edge Systems;IEEE Access;2024