A Robust Lightweight Network for Pedestrian Detection Based on YOLOv5-x

Author:

Chen Yuanjie1,Wang Chunyuan1ORCID,Zhang Chi2

Affiliation:

1. The College of Electronic and Electrical Engineering, Shanghai University of Engineering Science, Shanghai 201620, China

2. Department of Ethnic Music, Shanghai Conservatory of Music, Shanghai 200031, China

Abstract

Pedestrian detection is a crucial task in computer vision, with various applications in surveillance, autonomous driving, and robotics. However, detecting pedestrians in complex scenarios, such as rainy days, remains a challenging problem due to the degradation of image quality and the presence of occlusions. To address this issue, we propose RSTDet-Lite (a robust lightweight network) for pedestrian detection on rainy days, based on an improved version of YOLOv5-x. Specifically, in order to reduce the redundant parameters of the YOLOv5-x backbone network and enhance its feature extraction capability, we propose a novel approach named CBP-GNet, which incorporates a compact bilinear pooling algorithm. This new net serves as a new backbone network, resulting in significant parameter reduction and enhancing the fine-grained feature fusion capability of the network. Additionally, we introduce the Simple-BiFPN structure as a replacement for the original feature pyramid module based on the weighted bidirectional feature pyramid to further improve feature fusion efficiency. To enhance network performance, we integrate the CBAM attention mechanism and introduce the idea of structural reparameterization. To evaluate the performance of our method, we create a new dataset named RainDet3000, which consists of 3000 images captured in various rainy scenarios. The experimental results demonstrate that, compared with YOLOv5, our proposed model reduces the network size by 30 M while achieving a 4.56% increase in mAP. This confirms the effectiveness of RSTDet-Lite in achieving excellent performance in rainy-day pedestrian detection scenarios.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Reference33 articles.

1. Face description with local binary patterns: Application to face recognition;Ahonen;IEEE Trans. Pattern Anal. Mach. Intell.,2006

2. Dalal, N., and Triggs, B. (2005, January 20–25). Histograms of oriented gradients for human detection. Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), San Diego, CA, USA.

3. Wu, B., and Nevatia, R. (2005, January 17–21). Detection of multiple, partially occluded humans in a single image by bayesian combination of edgelet part detectors. Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV’05), Beijing, China.

4. Ye, L., and Keogh, E. (July, January 28). Time series shapelets: A new primitive for data mining. Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France.

5. Lienhart, R., and Maydt, J. (2002, January 22–25). An extended set of haar-like features for rapid object detection. Proceedings of the International Conference on Image Processing, Rochester, NY, USA.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3