Affiliation:
1. Karlsruhe Institute of Technology, Germany
2. Univ. Grenoble Alpes, CEA, CNRS, Grenoble INP, and IRIG-Spintec, France
3. RevoAI GmbH, Germany
Abstract
Recent development in neural networks (NNs) has led to their widespread use in critical and automated decision-making systems, where uncertainty estimation is essential for trustworthiness. Although conventional NNs can solve many problems accurately, they do not capture the uncertainty of the data or the model during optimization. In contrast, Bayesian neural networks (BNNs), which learn probabilistic distributions for their parameters, offer a sound theoretical framework for estimating uncertainty. However, traditional hardware implementations of BNNs are expensive in terms of computational and memory resources, as they (i) are realized with inefficient von Neumann architectures, (ii) use a significantly large number of random number generators (RNGs) to implement the distributions of BNNs, and (iii) have a substantially greater number of parameters than conventional NNs. Computing-in-memory (CiM) architectures with emerging resistive non-volatile memories (NVMs) are promising candidates for accelerating classical NNs. In particular, spintronic technology, which is distinguished by its low latency and high endurance, aligns very well with these requirements. In the specific context of Bayesian neural networks (BNNs), spintronics technologies are very valuable, thanks to their inherent potential to act as stochastic or as deterministic devices. Consequently, BNNs mapped on spintronic-based CiM architectures could be a highly efficient implementation strategy. However, the direct implementation on CiM hardware of the learned probabilistic distributions of BNN may not be feasible and can incur high overhead. In this work, we propose a new Bayesian neural network topology, named
SpinBayes
, that is able to perform efficient sampling during the Bayesian inference process. Moreover, a Bayesian approximation method, called
in-memory approximation
, is proposed that approximates the original probabilistic distributions of BNN with a distribution that can be efficiently mapped to spintronic-based CiM architectures. Compared to state-of-the-art methods, the memory overhead is reduced by 8× and the energy consumption by 80×. Our method has been evaluated on several classification and semantic segmentation tasks and can detect up to 100% of various types of out-of-distribution data, highlighting the robustness of our approach, without any performance sacrifice.
Publisher
Association for Computing Machinery (ACM)
Subject
Hardware and Architecture,Software
Reference41 articles.
1. Soyed Tuhin Ahmed, Kamal Danouchi, Christopher Münch, Guillaume Prenat, Lorena Anghel, and Mehdi B. Tahoori. 2022. Binary bayesian neural networks for efficient uncertainty estimation leveraging inherent stochasticity of spintronic devices. In IEEE/ACM NANOARCH.
2. SpinDrop: Dropout-Based Bayesian Binary Neural Networks With Spintronic Implementation
3. Compute-in-Memory Technologies and Architectures for Deep Learning Workloads
4. Concrete problems in AI safety;Amodei Dario;arXiv preprint arXiv:1606.06565,2016
5. Hiromitsu Awano et al. 2020. BYNQNet: Bayesian neural network with quadratic activations for sampling-free uncertainty estimation on FPGA. In DATE.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献