A Multiscale Parallel Pedestrian Recognition Algorithm Based on YOLOv5

Author:

Song Qi1ORCID,Zhou ZongHe1ORCID,Ji ShuDe1,Cui Tong2ORCID,Yao BuDan3,Liu ZeQi4

Affiliation:

1. College of Aerospace Engineering, Shenyang Aerospace University, Shenyang 110136, China

2. College of Artificial Intelligence, Shenyang Aerospace University, Shenyang 110136, China

3. College of Automation, Shenyang Aerospace University, Shenyang 110136, China

4. College of Electrical Engineering, Shanghai Dianji University, Shanghai 201306, China

Abstract

Mainstream pedestrian recognition algorithms have problems such as low accuracy and insufficient real-time performance. In this study, we developed an improved pedestrian recognition algorithm named YOLO-MSP (multiscale parallel) based on residual network ideas, and we improved the network architecture based on YOLOv5s. Three pooling layers were used in parallel in the MSP module to output multiscale features and improve the accuracy of the model while ensuring real-time performance. The Swin Transformer module was also introduced into the network, which improved the efficiency of the model in image processing by avoiding global calculations. The CBAM (Convolutional Block Attention Module) attention mechanism was added to the C3 module, and this new module was named the CBAMC3 module, which improved model efficiency while ensuring the model was lightweight. The WMD-IOU (weighted multidimensional IOU) loss function proposed in this study used the shape change between the recognition frame and the real frame as a parameter to calculate the loss of the recognition frame shape, which could guide the model to better learn the shape and size of the target and optimize recognition performance. Comparative experiments using the INRIA public data set showed that the proposed YOLO-MSP algorithm outperformed state-of-the-art pedestrian recognition methods in accuracy and speed.

Funder

The State Key Laboratory of Robotics

Publisher

MDPI AG

Reference39 articles.

1. Dalal, N., and Triggs, B. (2005, January 20–25). Histograms of oriented gradients for human recognition. Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), San Diego, CA, USA.

2. Dollár, P., Tu, Z., Perona, P., and Belongie, S. (2009, January 7–10). Integral channel features. Proceedings of the British Machine Vision Conference, BMVC 2009, London, UK.

3. Fast feature pyramids for object recognition;Appel;IEEE Trans. Pattern Anal. Mach. Intell.,2014

4. Gradient-based learning applied to document recognition;LeCun;Proc. IEEE,1998

5. Faster r-cnn: Towards real-time object recognition with region proposal networks;Ren;Adv. Neural Inf. Process. Syst.,2015

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3