Design of a Generic Dynamically Reconfigurable Convolutional Neural Network Accelerator with Optimal Balance
-
Published:2024-02-14
Issue:4
Volume:13
Page:761
-
ISSN:2079-9292
-
Container-title:Electronics
-
language:en
-
Short-container-title:Electronics
Author:
Tong Haoran1, Han Ke1ORCID, Han Si2ORCID, Luo Yingqi1
Affiliation:
1. School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China 2. School of Information Management for Laws, China University for Political Science and Law, Beijing 100091, China
Abstract
In many scenarios, edge devices perform computations for applications such as target detection and tracking, multimodal sensor fusion, low-light image enhancement, and image segmentation. There is an increasing trend of deploying and running multiple different network models on one hardware platform, but there is a lack of generic acceleration architectures that support standard convolution (CONV), depthwise separable CONV, and deconvolution (DeCONV) layers in such complex scenarios. In response, this paper proposes a more versatile dynamically reconfigurable CNN accelerator with a highly unified computing scheme. The proposed design, which is compatible with standard CNNs, lightweight CNNs, and CNNs with DeCONV layers, further improves the resource utilization and reduces the gap of efficiency when deploying different models. Thus, the hardware balance during the alternating execution of multiple models is enhanced. Compared to a state-of-the-art CNN accelerator, Xilinx DPU B4096, our optimized architecture achieves resource utilization improvements of 1.08× for VGG16 and 1.77× for MobileNetV1 in inference tasks on the Xilinx ZCU102 platform. The resource utilization and efficiency degradation between these two models are reduced to 59.6% and 63.7%, respectively. Furthermore, the proposed architecture can properly run DeCONV layers and demonstrates good performance.
Funder
Fundamental Research Funds for the Central Universities
Reference30 articles.
1. Self-Driving Cars: A Survey;Badue;Expert Syst. Appl.,2021 2. Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., and Adam, H. (2017). MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. arXiv. 3. Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., and Chen, L.-C. (2018, January 18–22). Mobilenetv2: Inverted Residuals and Linear Bottlenecks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA. 4. Network Deconvolution as a General Method to Distinguish Direct Dependencies in Networks;Feizi;Nat. Biotechnol.,2013 5. Noh, H., Hong, S., and Han, B. (2015, January 7–13). Learning Deconvolution Network for Semantic Segmentation. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.
|
|