A Lightweight YOLO Object Detection Algorithm Based on Bidirectional Multi‐Scale Feature Enhancement

Author:

Liu Qunpo12,Zhang Jingwen1ORCID,Zhang Zhuoran1,Bu Xuhui12,Hanajima Naohiko23

Affiliation:

1. School of Electrical Engineering and Automation Henan Polytechnic University Henan 454000 China

2. International Joint Laboratory of Direct Drive and Control Henan of Intelligent Equipment Henan 454000 China

3. College of Information and Systems Muroran Institute of Technology Hokkaido 050–8585 Japan

Abstract

AbstractThis paper proposes a lightweight YOLO object detection algorithm based on bidirectional multi‐scale feature enhancement. The problem is that the original YOLOv5 algorithm does not make full use of the relationship between the feature layers, resulting in the loss of target semantic information and a large number of parameters. First, a bidirectional multi‐scale feature‐enhanced weighted fusion backbone network is constructed to extract target features repeatedly. It enhances the fusion ability of shallow detail features and high‐level semantic information to capture richer multi‐scale semantic information. Second, the NCA attention module is built and integrated into the feature fusion network to enhance the critical characteristics of the target region. Finally, the Ghost module is used instead of the convolutional blocks in the original network to lighten the model while reducing the network complexity and training difficulty. Experimental results show that the improved YOLOv5 algorithm achieves 78.8% mAP@0.5 for the PASCAL VOC2012 dataset, which is 1.5% higher than the original algorithm, at 62.5 FPS. The number of parameters is also reduced by 43.6%. The mAP@0.5 on the self‐made metal foreign object dataset reached 98.4%, at 58.8 FPS, which can meet the requirements of end‐device deployment and real‐time detection.

Funder

Henan Provincial Science and Technology Research Project

Science and Technology Innovation Talents in Universities of Henan Province

National Natural Science Foundation of China

Publisher

Wiley

Reference36 articles.

1. R.Girshick J.Donahue T.Darrell J.Malik Proceed. IEEE/CVF Conf. on Comp. Vision Pattern Recog. 2014 580.

2. R.Girshick Proceed. IEEE Int. Conf. Comp. Vision 2015 1440.

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3