Hardware-Efficient Stochastic Binary CNN Architectures for Near-Sensor Computing-Reference-Cited by-同舟云学术

Hardware-Efficient Stochastic Binary CNN Architectures for Near-Sensor Computing

Published:2022-01-05 Issue: Volume:15 Page:
ISSN:1662-453X
Container-title:Frontiers in Neuroscience
language:
Short-container-title:Front. Neurosci.

Author:

Parmar Vivek,Penkovsky Bogdan,Querlioz Damien,Suri Manan

Abstract

With recent advances in the field of artificial intelligence (AI) such as binarized neural networks (BNNs), a wide variety of vision applications with energy-optimized implementations have become possible at the edge. Such networks have the first layer implemented with high precision, which poses a challenge in deploying a uniform hardware mapping for the network implementation. Stochastic computing can allow conversion of such high-precision computations to a sequence of binarized operations while maintaining equivalent accuracy. In this work, we propose a fully binarized hardware-friendly computation engine based on stochastic computing as a proof of concept for vision applications involving multi-channel inputs. Stochastic sampling is performed by sampling from a non-uniform (normal) distribution based on analog hardware sources. We first validate the benefits of the proposed pipeline on the CIFAR-10 dataset. To further demonstrate its application for real-world scenarios, we present a case-study of microscopy image diagnostics for pathogen detection. We then evaluate benefits of implementing such a pipeline using OxRAM-based circuits for stochastic sampling as well as in-memory computing-based binarized multiplication. The proposed implementation is about 1,000 times more energy efficient compared to conventional floating-precision-based digital implementations, with memory savings of a factor of 45.

Funder

Science and Engineering Research Board

European Research Council

Publisher

Frontiers Media SA

Subject

General Neuroscience

Reference42 articles.

1. Survey of stochastic computing;Alaghi;ACM Trans. Embedded Comput. Syst,2013

2. “Parapim: a parallel processing-in-memory accelerator for binary-weight deep neural networks,”;Angizi,2019

3. “In-memory and error-immune differential rram implementation of binarized deep neural networks,”;Bocquet,2018

4. Eyeriss v2: A flexible accelerator for emerging deep neural networks on mobile devices;Chen;IEEE J. Emerg. Select. Top. Circ. Syst,2019

5. “Chipmunk: A systolically scalable 0.9 mm2, 3.08gop/s/mw @ 1.2 mw accelerator for near-sensor recurrent neural network inference,”;Conti,2018

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Analysis of VMM computation strategies to implement BNN applications on RRAM arrays;APL Machine Learning;2023-04-13

2. A Systematic Literature Review on Binary Neural Networks;IEEE Access;2023

3. Reconfigurable and hardware efficient adaptive quantization model-based accelerator for binarized neural network;Computers and Electrical Engineering;2022-09

4. Time-Multiplexed In-Memory Computation Scheme for Mapping Quantized Neural Networks on Hybrid CMOS-OxRAM Building Blocks;IEEE Transactions on Nanotechnology;2022