Author:
Kaiser Md Abdullah-Al,Datta Gourav,Wang Zixu,Jacob Ajey P.,Beerel Peter A.,Jaiswal Akhilesh R.
Abstract
Edge devices equipped with computer vision must deal with vast amounts of sensory data with limited computing resources. Hence, researchers have been exploring different energy-efficient solutions such as near-sensor, in-sensor, and in-pixel processing, bringing the computation closer to the sensor. In particular, in-pixel processing embeds the computation capabilities inside the pixel array and achieves high energy efficiency by generating low-level features instead of the raw data stream from CMOS image sensors. Many different in-pixel processing techniques and approaches have been demonstrated on conventional frame-based CMOS imagers; however, the processing-in-pixel approach for neuromorphic vision sensors has not been explored so far. In this work, for the first time, we propose an asynchronous non-von-Neumann analog processing-in-pixel paradigm to perform convolution operations by integrating in-situ multi-bit multi-channel convolution inside the pixel array performing analog multiply and accumulate (MAC) operations that consume significantly less energy than their digital MAC alternative. To make this approach viable, we incorporate the circuit's non-ideality, leakage, and process variations into a novel hardware-algorithm co-design framework that leverages extensive HSpice simulations of our proposed circuit using the GF22nm FD-SOI technology node. We verified our framework on state-of-the-art neuromorphic vision sensor datasets and show that our solution consumes ~2× lower backend-processor energy while maintaining almost similar front-end (sensor) energy on the IBM DVS128-Gesture dataset than the state-of-the-art while maintaining a high test accuracy of 88.36%.
Subject
Computer Science Applications,Biomedical Engineering,Neuroscience (miscellaneous)
Reference54 articles.
1. “A low power, fully event-based gesture recognition system,”;Amir;2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR),2017
2. “Towards autonomous driving: a multi-modal 360° perception proposal,”;Beltrán,2020
3. A burst-mode word-serial address-event link-I: Transmitter design;Boahen;IEEE Trans. Circ. Syst. I Regular Pap.,2004
4. “Fully embedding fast convolutional networks on pixel processor arrays,”;Bose,2020
5. In-sensor computing for machine vision;Chai;Nature,2020
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献