Energy Complexity of Convolutional Neural Networks

Author:

Šíma Jiří1,Vidnerová Petra2,Mrázek Vojtěch3

Affiliation:

1. Institute of Computer Science of the Czech Academy of Sciences, 182 00 Prague 8, Czechia sima@cs.cas.cz

2. Institute of Computer Science of the Czech Academy of Sciences, 182 00 Prague 8, Czechia petra@cs.cas.cz

3. Faculty of Information Technology, Brno University of Technology, 612 00 Brno, Czechia mrazek@fit.vutbr.cz

Abstract

Abstract The energy efficiency of hardware implementations of convolutional neural networks (CNNs) is critical to their widespread deployment in low-power mobile devices. Recently, a number of methods have been proposed for providing energy-optimal mappings of CNNs onto diverse hardware accelerators. Their estimated energy consumption is related to specific implementation details and hardware parameters, which does not allow for machine-independent exploration of CNN energy measures. In this letter, we introduce a simplified theoretical energy complexity model for CNNs, based on only a two-level memory hierarchy that captures asymptotically all important sources of energy consumption for different CNN hardware implementations. In this model, we derive a simple energy lower bound and calculate the energy complexity of evaluating a CNN layer for two common data flows, providing corresponding upper bounds. According to statistical tests, the theoretical energy upper and lower bounds we present fit asymptotically very well with the real energy consumption of CNN implementations on the Simba and Eyeriss hardware platforms, estimated by the Timeloop/Accelergy program, which validates the proposed energy complexity model for CNNs.

Publisher

MIT Press

Reference20 articles.

1. Fused-layer CNN accelerators;Alwani,2016

2. Improving the accuracy and hardware efficiency of neural networks using approximate multipliers;Ansari;IEEE Transactions on Very Large Scale Integration Systems,2020

3. Hardware approximate techniques for deep neural network accelerators: A survey;Armeniakos;ACM Computing Surveys,2023

4. Eyeriss: A spatial architecture for energy-efficient dataflow for convolutional neural networks;Chen,2016

5. Deep learning with limited numerical precision;Gupta;Proceedings of the ICML 2015 32nd International Conference on Machine Learning,2015

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3