Affiliation:
1. Academia Sinica, Taiwan
2. Academia Sinica and National Taiwan University, Taiwan
3. Academia Sinica, National Taiwan University and National Chi Nan University, Taiwan
Abstract
The increasing paradigm shift towards
i
ntermittent computing has made it possible to intermittently execute
d
eep neural network (DNN) inference on edge devices powered by ambient energy. Recently,
n
eural architecture search (NAS) techniques have achieved great success in automatically finding DNNs with high accuracy and low inference latency on the deployed hardware. We make a key observation, where NAS attempts to improve inference latency by primarily maximizing data reuse, but the derived solutions when deployed on intermittently-powered systems may be inefficient, such that the inference may not satisfy an end-to-end latency requirement and, more seriously, they may be unsafe given an insufficient energy budget. This work proposes iNAS, which introduces intermittent execution behavior into NAS to find accurate network architectures with corresponding execution designs, which can safely and efficiently execute under intermittent power. An intermittent-aware execution design explorer is presented, which finds the right balance between data reuse and the costs related to intermittent inference, and incorporates a preservation design search space into NAS, while ensuring the power-cycle energy budget is not exceeded. To assess an intermittent execution design, an intermittent-aware abstract performance model is presented, which formulates the key costs related to progress preservation and recovery during intermittent inference. We implement iNAS on top of an existing NAS framework and evaluate their respective solutions found for various datasets, energy budgets and latency requirements, on a Texas Instruments device. Compared to those NAS solutions that can safely complete the inference, the iNAS solutions reduce the intermittent inference latency by 60% on average while achieving comparable accuracy, with an average 7% increase in search overhead.
Funder
Ministry of Science and Technology, Taiwan
Publisher
Association for Computing Machinery (ACM)
Subject
Hardware and Architecture,Software
Reference69 articles.
1. Demystifying Energy Consumption Dynamics in Transiently powered Computers
2. ARM. 2021. Cortex M4 DSP ISA. https://developer.arm.com/architectures/instruction-sets/dsp-extensions/dsp-for-cortex-m. ARM. 2021. Cortex M4 DSP ISA. https://developer.arm.com/architectures/instruction-sets/dsp-extensions/dsp-for-cortex-m.
Cited by
25 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献