Towards Ultra-High Performance and Energy Efficiency of Deep Learning Systems: An Algorithm-Hardware Co-Optimization Framework-Reference-Cited by-同舟云学术

Towards Ultra-High Performance and Energy Efficiency of Deep Learning Systems: An Algorithm-Hardware Co-Optimization Framework

Published:2018-04-29 Issue:1 Volume:32 Page:
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Wang Yanzhi,Ding Caiwen,Li Zhe,Yuan Geng,Liao Siyu,Ma Xiaolong,Yuan Bo,Qian Xuehai,Tang Jian,Qiu Qinru,Lin Xue

Abstract

Hardware accelerations of deep learning systems have been extensively investigated in industry and academia. The aim of this paper is to achieve ultra-high energy efficiency and performance for hardware implementations of deep neural networks (DNNs). An algorithm-hardware co-optimization framework is developed, which is applicable to different DNN types, sizes, and application scenarios. The algorithm part adopts the general block-circulant matrices to achieve a fine-grained tradeoff of accuracy and compression ratio. It applies to both fully-connected and convolutional layers and contains a mathematically rigorous proof of the effectiveness of the method. The proposed algorithm reduces computational complexity per layer from O(n2) to O(n log n) and storage complexity from O(n2) to O(n), both for training and inference. The hardware part consists of highly efficient Field Programmable Gate Array (FPGA)-based implementations using effective reconfiguration, batch processing, deep pipelining, resource re-using, and hierarchical control. Experimental results demonstrate that the proposed framework achieves at least 152X speedup and 71X energy efficiency gain compared with IBM TrueNorth processor under the same test accuracy. It achieves at least 31X energy efficiency gain compared with the reference FPGA-based work.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Stealthy Energy Consumption-oriented Attacks on Training Stage in Deep Learning;Journal of Signal Processing Systems;2023-10-11

2. Meta-Scheduling Framework With Cooperative Learning Toward Beyond 5G;IEEE Journal on Selected Areas in Communications;2023-06

3. Radio and Energy Resource Management in Renewable Energy-Powered Wireless Networks With Deep Reinforcement Learning;IEEE Transactions on Wireless Communications;2022-07

4. A Survey of FPGA-Based Vision Systems for Autonomous Cars;IEEE Access;2022