A Novel Lightweight Object Detection Network with Attention Modules and Hierarchical Feature Pyramid
Author:
Yang Shengying12ORCID, Chen Linfeng2, Wang Junxia2, Jin Wuyin1, Yu Yunxiang3
Affiliation:
1. School of Mechanical and Electrical Engineering, Lanzhou University of Technology, Lanzhou 730050, China 2. School of Information and Electronic Engineering, Zhejiang University of Science and Technology, Hangzhou 310023, China 3. Zhejiang Dingli Industry Co., Ltd., Lishui 321400, China
Abstract
Object detection methods based on deep learning typically require devices with ample computing capabilities, which limits their deployment in restricted environments such as those with embedded devices. To address this challenge, we propose Mini-YOLOv4, a lightweight real-time object detection network that achieves an excellent trade-off between speed and accuracy. Based on CSPDarknet-Tiny as the backbone network, we enhance the detection performance of the network in three ways. We use a multibranch structure embedded in an attention module for simultaneous spatial and channel attention calibration. We design a group self-attention block with a symmetric structure consisting of a pair of complementary self-attention modules to mine contextual information, thereby ensuring that the detection accuracy is improved without increasing the computational cost. Finally, we introduce a hierarchical feature pyramid network to fully exploit multiscale feature maps and promote the extraction of fine-grained features. The experimental results demonstrate that Mini-YOLOv4 requires only 4.7 M parameters and has a billion floating point operations (BFLOPs) value of 3.1. Compared with YOLOv4-Tiny, our approach achieves a 3.2% improvement in mean accuracy precision (mAP) for the PASCAL VOC dataset and obtains a significant improvement of 3.5% in overall detection accuracy for the MS COCO dataset. In testing with an embedded platform, Mini-YOLOv4 achieves a real-time detection speed of 25.6 FPS on the NVIDIA Jetson Nano, thus meeting the demand for real-time detection in computationally limited devices.
Funder
National Natural Science Foundation of China Scientific Research Fund of Zhejiang Provincial Education Department
Subject
Physics and Astronomy (miscellaneous),General Mathematics,Chemistry (miscellaneous),Computer Science (miscellaneous)
Reference66 articles.
1. Chen, R., Liu, Y., Zhang, M., Liu, S., Yu, B., and Tai, Y.-W. (2020, January 23–28). Dive deeper into box for object detection. Proceedings of the 2020 European Conference on Computer Vision (ECCV): 16th European Conference, Glasgow, UK. 2. Wu, Y., Chen, Y., Yuan, L., Liu, Z., Wang, L., Li, H., and Fu, Y. (2020, January 14–19). Rethinking classification and localization for object detection. Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA. 3. Qiu, H., Li, H., Wu, Q., and Shi, H. (2020, January 14–19). Offset bin classification network for accurate object detection. Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA. 4. Shi, H., Zhou, Q., Ni, Y., Wu, X., and Latecki, L.J. (2022, January 16–19). DPNET: Dual-path network for efficient object detection with Lightweight Self-Attention. Proceedings of the 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France. 5. EEEA-Net: An early exit evolutionary neural architecture search;Termritthikun;Eng. Appl. Artif. Intell.,2021
|
|