Enabling Training of Neural Networks on Noisy Hardware-Reference-Cited by-同舟云学术

Enabling Training of Neural Networks on Noisy Hardware

Published:2021-09-09 Issue: Volume:4 Page:
ISSN:2624-8212
Container-title:Frontiers in Artificial Intelligence
language:
Short-container-title:Front. Artif. Intell.

Author:

Gokmen Tayfun

Abstract

Deep neural networks (DNNs) are typically trained using the conventional stochastic gradient descent (SGD) algorithm. However, SGD performs poorly when applied to train networks on non-ideal analog hardware composed of resistive device arrays with non-symmetric conductance modulation characteristics. Recently we proposed a new algorithm, the Tiki-Taka algorithm, that overcomes this stringent symmetry requirement. Here we build on top of Tiki-Taka and describe a more robust algorithm that further relaxes other stringent hardware requirements. This more robust second version of the Tiki-Taka algorithm (referred to as TTv2) 1. decreases the number of device conductance states requirement from 1000s of states to only 10s of states, 2. increases the noise tolerance to the device conductance modulations by about 100x, and 3. increases the noise tolerance to the matrix-vector multiplication performed by the analog arrays by about 10x. Empirical simulation results show that TTv2 can train various neural networks close to their ideal accuracy even at extremely noisy hardware settings. TTv2 achieves these capabilities by complementing the original Tiki-Taka algorithm with lightweight and low computational complexity digital filtering operations performed outside the analog arrays. Therefore, the implementation cost of TTv2 compared to SGD and Tiki-Taka is minimal, and it maintains the usual power and speed benefits of using analog hardware for training workloads. Here we also show how to extract the neural network from the analog hardware once the training is complete for further model deployment. Similar to Bayesian model averaging, we form analog hardware compatible averages over the neural network weights derived from TTv2 iterates. This model average then can be transferred to another analog or digital hardware with notable improvements in test accuracy, transcending the trained model itself. In short, we describe an end-to-end training and model extraction technique for extremely noisy crossbar-based analog hardware that can be used to accelerate DNN training workloads and match the performance of full-precision SGD.

Publisher

Frontiers Media SA

Reference40 articles.

1. Achieving Ideal Accuracies in Analog Neuromorphic Computing Using Periodic Carry;Agarwal,2017

2. Equivalent-accuracy Accelerated Neural-Network Training Using Analogue Memory;Ambrogio;Nature,2018

3. Weight Uncertainty in Neural Networks;Blundell,2015

4. Language Models Are Few-Shot Learners;Brown,2020

Cited by 21 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Fast and robust analog in-memory deep neural network training;Nature Communications;2024-08-20

2. Difficulties and approaches in enabling learning-in-memory using crossbar arrays of memristors;Neuromorphic Computing and Engineering;2024-08-01

3. Gradient-free training of recurrent neural networks using random perturbations;Frontiers in Neuroscience;2024-07-10

4. Retention-aware zero-shifting technique for Tiki-Taka algorithm-based analog deep learning accelerator;Science Advances;2024-06-14

5. The Ouroboros of Memristors: Neural Networks Facilitating Memristor Programming;2024 IEEE 6th International Conference on AI Circuits and Systems (AICAS);2024-04-22