Nebula: A Scalable and Flexible Accelerator for DNN Multi-Branch Blocks on Embedded Systems-Reference-Cited by-同舟云学术

Nebula: A Scalable and Flexible Accelerator for DNN Multi-Branch Blocks on Embedded Systems

Published:2022-02-09 Issue:4 Volume:11 Page:505
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Yang Dawei,Li Xinlei,Qi Lizhe,Zhang Wenqiang,Jiang Zhe

Abstract

Deep neural networks (DNNs) are widely used in many artificial intelligence applications; many specialized DNN-inference accelerators have been proposed. However, existing DNN accelerators rely heavily on certain types of DNN operations (such as Conv, FC, and ReLU, etc.), which are either less used or likely to become out of date in future, posing challenges of flexibility and compatibility to existing work. This paper designs a flexible DNN accelerator from a more generic perspective rather than speeding up certain types of DNN operations. Our proposed Nebula exploits the width property of DNNs and gains a significant improvement in system throughput and energy efficiency over multi-branch architectures. Nebula is a first-of-its-kind framework for multi-branch DNNs.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/11/4/505/pdf

Reference34 articles.

1. Deep learning

2. Deep Learning;Bengio,2017

3. Bridging the Pragmatic Gaps for Mixed-Criticality Systems in the Automotive Industry

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Post-training quantization for re-parameterization via coarse & fine weight splitting;Journal of Systems Architecture;2024-02

2. BlueFace: Integrating an Accelerator into the Core’s Pipeline through Algorithm-Interface Co-Design for Real-Time SoCs;2023 60th ACM/IEEE Design Automation Conference (DAC);2023-07-09