A Lightweight YOLO Object Detection Algorithm Based on Bidirectional Multi‐Scale Feature Enhancement
-
Published:2024-03-12
Issue:5
Volume:7
Page:
-
ISSN:2513-0390
-
Container-title:Advanced Theory and Simulations
-
language:en
-
Short-container-title:Advcd Theory and Sims
Author:
Liu Qunpo12,
Zhang Jingwen1ORCID,
Zhang Zhuoran1,
Bu Xuhui12,
Hanajima Naohiko23
Affiliation:
1. School of Electrical Engineering and Automation Henan Polytechnic University Henan 454000 China
2. International Joint Laboratory of Direct Drive and Control Henan of Intelligent Equipment Henan 454000 China
3. College of Information and Systems Muroran Institute of Technology Hokkaido 050–8585 Japan
Abstract
AbstractThis paper proposes a lightweight YOLO object detection algorithm based on bidirectional multi‐scale feature enhancement. The problem is that the original YOLOv5 algorithm does not make full use of the relationship between the feature layers, resulting in the loss of target semantic information and a large number of parameters. First, a bidirectional multi‐scale feature‐enhanced weighted fusion backbone network is constructed to extract target features repeatedly. It enhances the fusion ability of shallow detail features and high‐level semantic information to capture richer multi‐scale semantic information. Second, the NCA attention module is built and integrated into the feature fusion network to enhance the critical characteristics of the target region. Finally, the Ghost module is used instead of the convolutional blocks in the original network to lighten the model while reducing the network complexity and training difficulty. Experimental results show that the improved YOLOv5 algorithm achieves 78.8% mAP@0.5 for the PASCAL VOC2012 dataset, which is 1.5% higher than the original algorithm, at 62.5 FPS. The number of parameters is also reduced by 43.6%. The mAP@0.5 on the self‐made metal foreign object dataset reached 98.4%, at 58.8 FPS, which can meet the requirements of end‐device deployment and real‐time detection.
Funder
Henan Provincial Science and Technology Research Project
Science and Technology Innovation Talents in Universities of Henan Province
National Natural Science Foundation of China
Reference36 articles.
1. R.Girshick J.Donahue T.Darrell J.Malik Proceed. IEEE/CVF Conf. on Comp. Vision Pattern Recog. 2014 580.
2. R.Girshick Proceed. IEEE Int. Conf. Comp. Vision 2015 1440.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献