PBA-YOLOv7: An Object Detection Method Based on an Improved YOLOv7 Network-Reference-Cited by-同舟云学术

PBA-YOLOv7: An Object Detection Method Based on an Improved YOLOv7 Network

Published:2023-09-18 Issue:18 Volume:13 Page:10436
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Sun Yang¹,Li Yi¹,Li Song¹,Duan Zehao¹,Ning Haonan¹,Zhang Yuhang¹

Affiliation:

1. College of Mechanical and Equipment Engineering, Hebei University of Engineering, Handan 056038, China

Abstract

Deep learning-based object detection methods address the problem of how to trade off the object detection accuracy and detection speed of the model. This paper proposes the PBA-YOLOv7 network algorithm, which is based on the YOLOv7 network, and first introduces the PConv, which lightens the ELAN module in the backbone network structure and reduces the number of parameters to improve the detection speed of the network and then designs and introduces the BiFusionNet network, which better aggregates the high-level semantic features and the low-level semantic features; and finally, on this basis, the coordinate attention mechanism is introduced to make the network focus on more critical features without increasing the model complexity. The coordinate attention mechanism is introduced to make the network focus more on important feature information and improve the feature expression ability of the network without increasing the model complexity. Experiments on the publicly available KITTI’s dataset show that the PBA-YOLOv7 network model significantly improves both detection accuracy and detection speed compared to the original YOLOv7 model, with 4% and 7.8% improvement in mAP0.5 and mAP0.5:0.95, respectively, and six frames improvement in FPS. The improved algorithm in this paper weighs the model’s detection accuracy and detection speed in the detection task. It performs well compared to other algorithms, such as YOLOv7 and YOLOv5l.

Funder

Natural Science Foundation of Hebei Province

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/13/18/10436/pdf

Reference38 articles.

1. The challenges of autonomous driving;Liu;Intell. Connect. Cars,2019

2. Distinctive image features from scale-invariant key points;Lowed;Int. J. Comput. Vis.,2004

3. Dalal, N., and Triggs, B. (2005, January 20–26). Histograms of oriented gradients for human detection. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Diego, CA, USA.

4. Felzenszwalb, P., McAllester, D., and Ramanan, D. (2008, January 23–28). A discriminatively trained, multiscale, deformable part model. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Anchorage, AK, USA.

5. Support vector machines;Hearst;IEEE Intell. Syst. Appl.,1998

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multi-Image Fusion-Based Defect Detection Method for Real-Time Monitoring of Recoating in Ceramic Additive Manufacturing;3D Printing and Additive Manufacturing;2024-08-29

2. Lightweight wildfire smoke monitoring algorithm based on unmanned aerial vehicle vision;Signal, Image and Video Processing;2024-06-28

3. An Improved YOLOv7-Based Model for Real-Time Meter Reading with PConv and Attention Mechanisms;Sensors;2024-05-31