Simplification of Deep Neural Network-Based Object Detector for Real-Time Edge Computing-Reference-Cited by-同舟云学术

Simplification of Deep Neural Network-Based Object Detector for Real-Time Edge Computing

Published:2023-04-06 Issue:7 Volume:23 Page:3777
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Choi Kyoungtaek¹^ORCID,Wi Seong Min²,Jung Ho Gi³^ORCID,Suhr Jae Kyu⁴^ORCID

Affiliation:

1. Department of AI Automation Robot, Daegu Catholic University, 13-13 Hayang-ro, Hayang-eup, Gyeongsan-si 38430, Gyeongsangbuk-do, Republic of Korea

2. Driving Image Recognition Logic Cell, Hyundai Mobis, 17-2 Mabuk-ro 240beon-gil, Giheung-gu, Yongin-si 16891, Gyeonggi-do, Republic of Korea

3. Department of Electronic Engineering, Korea National University of Transportation, 50 Daehak-ro, Chungju-si 27469, Chungbuk-do, Republic of Korea

4. Department of Intelligent Mechatronics Engineering, Sejong University, 209 Neungdong-ro, Gwangjin-gu, Seoul 05006, Republic of Korea

Abstract

This paper presents a method for simplifying and quantizing a deep neural network (DNN)-based object detector to embed it into a real-time edge device. For network simplification, this paper compares five methods for applying channel pruning to a residual block because special care must be taken regarding the number of channels when summing two feature maps. Based on the comparison in terms of detection performance, parameter number, computational complexity, and processing time, this paper discovers the most satisfying method on the edge device. For network quantization, this paper compares post-training quantization (PTQ) and quantization-aware training (QAT) using two datasets with different detection difficulties. This comparison shows that both approaches are recommended in the case of the easy-to-detect dataset, but QAT is preferable in the case of the difficult-to-detect dataset. Through experiments, this paper shows that the proposed method can effectively embed the DNN-based object detector into an edge device equipped with Qualcomm’s QCS605 System-on-Chip (SoC), while achieving a real-time operation with more than 10 frames per second.

Funder

National Research Foundation of Korea

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/23/7/3777/pdf

Reference70 articles.

1. Ghimire, D., Kil, D., and Kim, S.H. (2022). A Survey on Efficient Convolutional Neural Networks and Hardware Acceleration. Electronics, 11.

2. Neill, J.O. (2020). An Overview of Neural Network Compression. arXiv.

3. Mishra, R., Gupta, H.P., and Dutta, T. (2020). A Survey on Deep Neural Network Compression: Challenges, Overview, and Solutions. arXiv.