BlocTrain: Block-Wise Conditional Training and Inference for Efficient Spike-Based Deep Learning-Reference-Cited by-同舟云学术

BlocTrain: Block-Wise Conditional Training and Inference for Efficient Spike-Based Deep Learning

Published:2021-10-29 Issue: Volume:15 Page:
ISSN:1662-453X
Container-title:Frontiers in Neuroscience
language:
Short-container-title:Front. Neurosci.

Author:

Srinivasan Gopalakrishnan,Roy Kaushik

Abstract

Spiking neural networks (SNNs), with their inherent capability to learn sparse spike-based input representations over time, offer a promising solution for enabling the next generation of intelligent autonomous systems. Nevertheless, end-to-end training of deep SNNs is both compute- and memory-intensive because of the need to backpropagate error gradients through time. We propose BlocTrain, which is a scalable and complexity-aware incremental algorithm for memory-efficient training of deep SNNs. We divide a deep SNN into blocks, where each block consists of few convolutional layers followed by a classifier. We train the blocks sequentially using local errors from the classifier. Once a given block is trained, our algorithm dynamically figures out easy vs. hard classes using the class-wise accuracy, and trains the deeper block only on the hard class inputs. In addition, we also incorporate a hard class detector (HCD) per block that is used during inference to exit early for the easy class inputs and activate the deeper blocks only for the hard class inputs. We trained ResNet-9 SNN divided into three blocks, using BlocTrain, on CIFAR-10 and obtained 86.4% accuracy, which is achieved with up to 2.95× lower memory requirement during the course of training, and 1.89× compute efficiency per inference (due to early exit strategy) with 1.45× memory overhead (primarily due to classifier weights) compared to end-to-end network. We also trained ResNet-11, divided into four blocks, on CIFAR-100 and obtained 58.21% accuracy, which is one of the first reported accuracy for SNN trained entirely with spike-based backpropagation on CIFAR-100.

Publisher

Frontiers Media SA

Subject

General Neuroscience

Reference67 articles.

1. Neural machine translation by jointly learning to align and translate;Bahdanau;arXiv preprint arXiv,2014

2. Greedy layerwise learning can scale to imagenet;Belilovsky,2019

3. Long short-term memory and learning-to-learn in networks of spiking neurons;Bellec,2018

4. Greedy layer-wise training of deep networks;Bengio,2007

5. Benchmarking keyword spotting efficiency on neuromorphic hardware;Blouw,2019

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Exploring Neuromorphic Computing Based on Spiking Neural Networks: Algorithms to Hardware;ACM Computing Surveys;2023-03-02