Double-Shift: A Low-Power DNN Weights Storage and Access Framework based on Approximate Decomposition and Quantization-Reference-Cited by-同舟云学术

Double-Shift: A Low-Power DNN Weights Storage and Access Framework based on Approximate Decomposition and Quantization

Published:2022-03-31 Issue:2 Volume:27 Page:1-16
ISSN:1084-4309
Container-title:ACM Transactions on Design Automation of Electronic Systems
language:en
Short-container-title:ACM Trans. Des. Autom. Electron. Syst.

Author:

Han Ming¹,Wang Ye¹,Dong Jian¹,Qu Gang²

Affiliation:

1. School of Computer Science and Technology, Harbin Institute of Technology

2. Electrical and Computer Engineering Department and Institute for System Research, University of Maryland, College Park, USA

Abstract

One major challenge in deploying Deep Neural Network (DNN) in resource-constrained applications, such as edge nodes, mobile embedded systems, and IoT devices, is its high energy cost. The emerging approximate computing methodology can effectively reduce the energy consumption during the computing process in DNN. However, a recent study shows that the weight storage and access operations can dominate DNN's energy consumption due to the fact that the huge size of DNN weights must be stored in the high-energy-cost DRAM. In this paper, we propose Double-Shift, a low-power DNN weight storage and access framework, to solve this problem. Enabled by approximate decomposition and quantization, Double-Shift can reduce the data size of the weights effectively. By designing a novel weight storage allocation strategy, Double-Shift can boost the energy efficiency by trading the energy consuming weight storage and access operations for low-energy-cost computations. Our experimental results show that Double-Shift can reduce DNN weights to 3.96%–6.38% of the original size and achieve an energy saving of 86.47%–93.62%, while introducing a DNN classification error within 2%.

Publisher

Association for Computing Machinery (ACM)

Subject

Electrical and Electronic Engineering,Computer Graphics and Computer-Aided Design,Computer Science Applications

Link

https://dl.acm.org/doi/pdf/10.1145/3477047

Reference21 articles.

1. SmartExchange: Trading Higher-cost Memory Storage/Access for Lower-cost Computation

2. Write Buffer-Oriented Energy Reduction in the L1 Data Cache for Embedded Systems