Affiliation:
1. School of Automotive and Traffic Engineering, Jiangsu University, Zhenjiang 212013, China
Abstract
Pavement defect detection technology stands as a pivotal component within intelligent driving systems, demanding heightened precision and rapid detection rates. Addressing the complexities arising from diverse defect types and intricate backgrounds in visual sensing, this study introduces an enhanced approach to augment the network structure and activation function within the foundational YOLOv5 algorithm. Initially, modifications to the YOLOv5′s architecture incorporate an adjustment to the Leaky ReLU activation function, thereby enhancing regression stability and accuracy. Subsequently, the integration of bi-level routing attention into the network’s head layer optimizes the attention mechanism, notably improving overall efficiency. Additionally, the replacement of the YOLOv5 backbone layer’s C3 module with the C3-TST module enhances initial convergence efficiency in target detection. Comparative analysis against the original YOLOv5s network reveals a 2% enhancement in map50 and a 1.8% improvement in F1, signifying an overall advancement in network performance. The initial convergence rate of the algorithm has been improved, and the accuracy and operational efficiency have also been greatly improved, especially on models with small-scale training sets.
Funder
National Natural Science Foundation
Reference40 articles.
1. Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 17–21). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Seattle, VA, USA.
2. Demasi, F., Loprencipe, G., and Moretti, L. (2018). Road safety analysis of urban roads: Case study of an Italian municipality. Safety, 4.
3. Viola, P., and Jones, M. (2001, January 7–14). Robust real-time face detection. Proceedings of the Procedures Eight IEEE International Conference on Computer Vision ICCV 2001, Vancouver, BC, Canada.
4. Comparison of Regularization Methods for Imagenet Classification with Deep Revolutionary Neural Networks;Smirnov;AASRI Procedia,2014
5. Girshock, r. (2015, January 7–13). Fast R-CNN. Proceedings of the Procedures of IEEE International Conference on Computer Vision, Washington, DC, USA.