Affiliation:
1. Computer Science and Engineering, University of California San Diego, La Jolla, United States
2. Fermi National Accelerator Laboratory, Batavia, United States
3. Physics, University of California San Diego, La Jolla, United States
Abstract
Edge computation often requires robustness to faults, e.g., to reduce the effects of transient errors and to function correctly in high radiation environments. In these cases, the edge device must be designed with fault tolerance as a primary objective.
FKeras
is a tool that helps design fault-tolerant edge neural networks (NNs) that run entirely on chip to meet strict latency and resource requirements.
FKeras
provides metrics that give a bit-level ranking of NN weights with respect to their sensitivity to faults.
FKeras
includes these sensitivity metrics to guide efficient fault injection campaigns to help evaluate the robustness of an NN architecture. We show how to use
FKeras
in the codesign of edge NNs trained on the high-granularity endcap calorimeter dataset, which represents high energy physics data, as well as the CIFAR-10 dataset. We use
FKeras
to analyze an NN’s fault tolerance to consider alongside its accuracy, performance, and resource consumption. The results show that the different NN architectures have vastly differing resilience to faults.
FKeras
can also determine how to protect NN weights best, e.g., by selectively using triple modular redundancy on only the most sensitive weights, which reduces area without affecting accuracy.
Funder
National Science Foundation Graduate Research Fellowship Program
U.S. Department of Energy (DOE), Office of Science, Office of Advanced Scientific Computing Research
DOE, Office of Science, Office of High Energy Physics Early Career Research
U.S. National Science Foundation Harnessing the Data Revolution (HDR) Institute for Accelerating AI Algorithms for Data Driven Discovery
DOE Early Career Research
Publisher
Association for Computing Machinery (ACM)
Reference78 articles.
1. Artificial neural networks in medical diagnosis;Al-Shayea Qeethara Kadhim;International Journal of Computer Science Issues,2011
2. LHC and HL-LHC: Present and future radiation environment in the high-luminosity collision points and RHA implications;IEEE Transactions on Nuclear Science,2017
3. Medical Image Analysis using Convolutional Neural Networks: A Review
4. Understanding the propagation of transient errors in HPC applications
5. Randomized algorithms for estimating the trace of an implicit symmetric positive semi-definite matrix
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献