A Multiscale Parallel Pedestrian Recognition Algorithm Based on YOLOv5-Reference-Cited by-同舟云学术

A Multiscale Parallel Pedestrian Recognition Algorithm Based on YOLOv5

Published:2024-05-20 Issue:10 Volume:13 Page:1989
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Song Qi¹^ORCID,Zhou ZongHe¹^ORCID,Ji ShuDe¹,Cui Tong²^ORCID,Yao BuDan³,Liu ZeQi⁴

Affiliation:

1. College of Aerospace Engineering, Shenyang Aerospace University, Shenyang 110136, China

2. College of Artificial Intelligence, Shenyang Aerospace University, Shenyang 110136, China

3. College of Automation, Shenyang Aerospace University, Shenyang 110136, China

4. College of Electrical Engineering, Shanghai Dianji University, Shanghai 201306, China

Abstract

Mainstream pedestrian recognition algorithms have problems such as low accuracy and insufficient real-time performance. In this study, we developed an improved pedestrian recognition algorithm named YOLO-MSP (multiscale parallel) based on residual network ideas, and we improved the network architecture based on YOLOv5s. Three pooling layers were used in parallel in the MSP module to output multiscale features and improve the accuracy of the model while ensuring real-time performance. The Swin Transformer module was also introduced into the network, which improved the efficiency of the model in image processing by avoiding global calculations. The CBAM (Convolutional Block Attention Module) attention mechanism was added to the C3 module, and this new module was named the CBAMC3 module, which improved model efficiency while ensuring the model was lightweight. The WMD-IOU (weighted multidimensional IOU) loss function proposed in this study used the shape change between the recognition frame and the real frame as a parameter to calculate the loss of the recognition frame shape, which could guide the model to better learn the shape and size of the target and optimize recognition performance. Comparative experiments using the INRIA public data set showed that the proposed YOLO-MSP algorithm outperformed state-of-the-art pedestrian recognition methods in accuracy and speed.

Funder

The State Key Laboratory of Robotics

Publisher

MDPI AG

Link

https://www.mdpi.com/2079-9292/13/10/1989/pdf

Reference39 articles.

1. Dalal, N., and Triggs, B. (2005, January 20–25). Histograms of oriented gradients for human recognition. Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), San Diego, CA, USA.

2. Dollár, P., Tu, Z., Perona, P., and Belongie, S. (2009, January 7–10). Integral channel features. Proceedings of the British Machine Vision Conference, BMVC 2009, London, UK.

3. Fast feature pyramids for object recognition;Appel;IEEE Trans. Pattern Anal. Mach. Intell.,2014

4. Gradient-based learning applied to document recognition;LeCun;Proc. IEEE,1998

5. Faster r-cnn: Towards real-time object recognition with region proposal networks;Ren;Adv. Neural Inf. Process. Syst.,2015