SalvageDNN: salvaging deep neural network accelerators with permanent faults through saliency-driven fault-aware mapping-Reference-Cited by-同舟云学术

SalvageDNN: salvaging deep neural network accelerators with permanent faults through saliency-driven fault-aware mapping

Published:2019-12-23 Issue:2164 Volume:378 Page:20190164
ISSN:1364-503X
Container-title:Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences
language:en
Short-container-title:Phil. Trans. R. Soc. A.

Author:

Abdullah Hanif Muhammad¹,Shafique Muhammad¹^ORCID

Affiliation:

1. Technische Universität Wien (TU Wien), Vienna, Austria

Abstract

Deep neural networks (DNNs) have proliferated in most of the application domains that involve data processing, predictive analysis and knowledge inference. Alongside the need for developing highly performance-efficient DNN accelerators, there is an utmost need to improve the yield of the manufacturing process in order to reduce the per unit cost of the DNN accelerators. To this end, we present ‘SalvageDNN’, a methodology to enable reliable execution of DNNs on the hardware accelerators with permanent faults (typically due to imperfect manufacturing processes). It employs a fault-aware mapping of different parts of a given DNN on the hardware accelerator (subjected to faults) by leveraging the saliency of the DNN parameters and the fault map of the underlying processing hardware. We also present novel modifications in a systolic array design to further improve the yield of the accelerators while ensuring reliable DNN execution using ‘SalvageDNN’ and negligible overheads in terms of area, power/energy and performance. This article is part of the theme issue ‘Harmonizing energy-autonomous computing and intelligence’.

Funder

Deutsche Forschungsgemeinschaft

Publisher

The Royal Society

Subject

General Physics and Astronomy,General Engineering,General Mathematics

Link

https://royalsocietypublishing.org/doi/pdf/10.1098/rsta.2019.0164

Reference29 articles.

1. Deep learning

2. Efficient Processing of Deep Neural Networks: A Tutorial and Survey

3. Jouppi NP et al. 2017 In-datacenter performance analysis of a tensor processing unit. In 2017 ACM/IEEE 44th Annual Int. Symp. Computer Architecture (ISCA) Toronto ON Canada 24–28 June 2017 pp. 1–12. New York NY: ACM.

4. Hanif MA Putra RVW Tanvir M Hafiz R Rehman S Shafique M. 2018 Mpna: a massively-parallel neural array accelerator with dataflow optimization for convolutional neural networks. (http://arxiv.org/abs/1810.12910)

5. Eyeriss v2: A Flexible Accelerator for Emerging Deep Neural Networks on Mobile Devices

Cited by 32 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Cost-Effective Fault Tolerance for CNNs Using Parameter Vulnerability Based Hardening and Pruning;2024 IEEE 30th International Symposium on On-Line Testing and Robust System Design (IOLTS);2024-07-03

2. Design Exploration of Fault-Tolerant Deep Neural Networks Using Posit Number Representation System;IEEE Transactions on Very Large Scale Integration (VLSI) Systems;2024-07

3. Evaluating the Reliability of Supervised Compression for Split Computing;2024 IEEE 42nd VLSI Test Symposium (VTS);2024-04-22

4. A Systematic Literature Review on Hardware Reliability Assessment Methods for Deep Neural Networks;ACM Computing Surveys;2024-01-22

5. A Quality-Aware Voltage Overscaling Framework to Improve the Energy Efficiency and Lifetime of TPUs Based on Statistical Error Modeling;IEEE Access;2024