A Novel Lightweight Object Detection Network with Attention Modules and Hierarchical Feature Pyramid-Reference-Cited by-同舟云学术

A Novel Lightweight Object Detection Network with Attention Modules and Hierarchical Feature Pyramid

Published:2023-11-17 Issue:11 Volume:15 Page:2080
ISSN:2073-8994
Container-title:Symmetry
language:en
Short-container-title:Symmetry

Author:

Yang Shengying¹²^ORCID,Chen Linfeng²,Wang Junxia²,Jin Wuyin¹,Yu Yunxiang³

Affiliation:

1. School of Mechanical and Electrical Engineering, Lanzhou University of Technology, Lanzhou 730050, China

2. School of Information and Electronic Engineering, Zhejiang University of Science and Technology, Hangzhou 310023, China

3. Zhejiang Dingli Industry Co., Ltd., Lishui 321400, China

Abstract

Object detection methods based on deep learning typically require devices with ample computing capabilities, which limits their deployment in restricted environments such as those with embedded devices. To address this challenge, we propose Mini-YOLOv4, a lightweight real-time object detection network that achieves an excellent trade-off between speed and accuracy. Based on CSPDarknet-Tiny as the backbone network, we enhance the detection performance of the network in three ways. We use a multibranch structure embedded in an attention module for simultaneous spatial and channel attention calibration. We design a group self-attention block with a symmetric structure consisting of a pair of complementary self-attention modules to mine contextual information, thereby ensuring that the detection accuracy is improved without increasing the computational cost. Finally, we introduce a hierarchical feature pyramid network to fully exploit multiscale feature maps and promote the extraction of fine-grained features. The experimental results demonstrate that Mini-YOLOv4 requires only 4.7 M parameters and has a billion floating point operations (BFLOPs) value of 3.1. Compared with YOLOv4-Tiny, our approach achieves a 3.2% improvement in mean accuracy precision (mAP) for the PASCAL VOC dataset and obtains a significant improvement of 3.5% in overall detection accuracy for the MS COCO dataset. In testing with an embedded platform, Mini-YOLOv4 achieves a real-time detection speed of 25.6 FPS on the NVIDIA Jetson Nano, thus meeting the demand for real-time detection in computationally limited devices.

Funder

National Natural Science Foundation of China

Scientific Research Fund of Zhejiang Provincial Education Department

Publisher

MDPI AG

Subject

Physics and Astronomy (miscellaneous),General Mathematics,Chemistry (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2073-8994/15/11/2080/pdf

Reference66 articles.

1. Chen, R., Liu, Y., Zhang, M., Liu, S., Yu, B., and Tai, Y.-W. (2020, January 23–28). Dive deeper into box for object detection. Proceedings of the 2020 European Conference on Computer Vision (ECCV): 16th European Conference, Glasgow, UK.

2. Wu, Y., Chen, Y., Yuan, L., Liu, Z., Wang, L., Li, H., and Fu, Y. (2020, January 14–19). Rethinking classification and localization for object detection. Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.

3. Qiu, H., Li, H., Wu, Q., and Shi, H. (2020, January 14–19). Offset bin classification network for accurate object detection. Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.

4. Shi, H., Zhou, Q., Ni, Y., Wu, X., and Latecki, L.J. (2022, January 16–19). DPNET: Dual-path network for efficient object detection with Lightweight Self-Attention. Proceedings of the 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France.

5. EEEA-Net: An early exit evolutionary neural architecture search;Termritthikun;Eng. Appl. Artif. Intell.,2021