Neuromorphic processor-oriented hybrid Q-format multiplication with adaptive quantization for tiny YOLO3-Reference-Cited by-同舟云学术

Neuromorphic processor-oriented hybrid Q-format multiplication with adaptive quantization for tiny YOLO3

Published:2023-02-13 Issue:15 Volume:35 Page:11013-11041
ISSN:0941-0643
Container-title:Neural Computing and Applications
language:en
Short-container-title:Neural Comput & Applic

Author:

Li Tao^ORCID,Ma Yitao,Endoh Tetsuo

Abstract

AbstractDeep neural networks (DNNs) have delivered unprecedented achievements in the modern Internet of Everything society, encompassing autonomous driving, expert diagnosis, unmanned supermarkets, etc. It continues to be challenging for researchers and engineers to develop a high-performance neuromorphic processor for deployment in edge devices or embedded hardware. DNNs’ superpower derives from their enormous and complex network architecture, which is computation-intensive, time-consuming, and energy-heavy. Due to the limited perceptual capacity of humans, accurate processing results from DNNs require a substantial amount of computing time, making them redundant in some applications. Utilizing adaptive quantization technology to compress the DNN model with sufficient accuracy is crucial for facilitating the deployment of neuromorphic processors in emerging edge applications. This study proposes a method to boost the development of neuromorphic processors by conducting fixed-point multiplication in a hybrid Q-format using an adaptive quantization technique on the convolution of tiny YOLO3. In particular, this work integrates the sign-bit check and bit roundoff techniques into the arithmetic of fixed-point multiplications to address overflow and roundoff issues within the convolution’s adding and multiplying operations. In addition, a hybrid Q-format multiplication module is developed to assess the proposed method from a hardware perspective. The experimental results prove that the hybrid multiplication with adaptive quantization on the tiny YOLO3’s weights and feature maps possesses a lower error rate than alternative fixed-point representation formats while sustaining the same object detection accuracy. Moreover, the fixed-point numbers represented by Q(6.9) have a suboptimal error rate, which can be utilized as an alternative representation form for the tiny YOLO3 algorithm-based neuromorphic processor design. In addition, the 8-bit hybrid Q-format multiplication module exhibits low power consumption and low latency in contrast to benchmark multipliers.

Funder

Japan Society for the Promotion of Science

New Energy and Industrial Technology Development Organization

Center for Innovative Integrated Electronic Systems (CIES) consortium

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Software

Link

https://link.springer.com/content/pdf/10.1007/s00521-023-08280-y.pdf

Reference60 articles.

1. Mead C (1990) Neuromorphic electronic systems. Proc IEEE 78(10):1629–1636

2. Yang S, Wang J, Deng B, Azghadi MR, Linares-Barranco B (2021) Neuromorphic context-dependent learning framework with fault-tolerant spike routing. IEEE Trans Neural Netw Learn Syst. https://doi.org/10.1109/TNNLS.2021.3084250

3. Schuman CD, Kulkarni SR, Parsa M, Mitchell JP, Kay B (2022) Opportunities for neuromorphic computing algorithms and applications. Nat Comput Sci 2(1):10–19

4. Yang S, Gao T, Wang J, Deng B, Azghadi MR, Lei T, Linares-Barranco B (2022) SAM: a unified self-adaptive multicompartmental spiking neuron model for learning With working memory. Front Neurosci 16(850945):1–22

5. Shaban A, Bezugam SS, Suri M (2021) An adaptive threshold neuron for recurrent spiking neural networks with nanodevice hardware implementation. Nat Commun 12(1):1–11