YOLOv7 Optimization Model Based on Attention Mechanism Applied in Dense Scenes
-
Published:2023-08-11
Issue:16
Volume:13
Page:9173
-
ISSN:2076-3417
-
Container-title:Applied Sciences
-
language:en
-
Short-container-title:Applied Sciences
Author:
Wang Jiabao1, Wu Jun1, Wu Junwei1, Wang Jiangpeng1, Wang Ji1
Affiliation:
1. School of Computer Science, Hubei University of Technology, Wuhan 430068, China
Abstract
With object detection technology, real-time detection of dense scenes has become an important application requirement in various industries, which is of great significance for improving production efficiency and ensuring public safety. However, the current mainstream target detection algorithms have problems such as insufficient accuracy or inability to achieve real-time detection when detecting dense scenes, and to address this problem this paper improves the YOLOv7 model using attention mechanisms that can enhance critical information. Based on the original YOLOv7 network model, part of the traditional convolutional layers are replaced with the standard convolution combined with the attention mechanism. After comparing the optimization results of three different attention mechanisms, CBAM, CA, and SimAM, the YOLOv7B-CBAM model is proposed, which effectively improves the accuracy of object detection in dense scenes. The results on VOC datasets show that the YOLOv7B-CBAM model has the highest accuracy, reaching 87.8%, 1.5% higher than that of the original model, and outperforms the original model as well as other models with improved attention mechanisms in the subsequent results of two other different dense scene practical application scenarios. This model can be applied to public safety detection, agricultural detection, and other fields, saving labor costs, improving public health, reducing the spread and loss of plant diseases, and realizing high-precision, real-time target detection.
Funder
National Natural Science Foundation of China Hubei Province Science and Technology Support Project
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference45 articles.
1. RGB-D Image Multi-Target Detection Method Based on 3D DSF R-CNN;Hu;Int. J. Pattern Recognit. Artif. Intell.,2019 2. Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27–30). You Only Look Once: Unified, Real-Time Object Detection. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA. 3. Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., and Berg, A.C. (2016). European Conference on Computer Vision, Springer. 4. Redmon, J., and Farhadi, A. (2017, January 21–26). YOLO9000: Better, Faster, Stronger. Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, HI, USA. 5. Focal Loss for Dense Object Detection;Lin;IEEE Trans. Pattern Anal. Mach. Intell.,2020
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. AI-enhanced real-time cattle identification system through tracking across various environments;Scientific Reports;2024-08-01 2. Dam surface crack detection based on SE-YOLOv7;Third International Conference on Electronic Information Engineering and Data Processing (EIEDP 2024);2024-07-05 3. High-Speed Motion Target Real-Time Detection Based on Lightweight Deep Feature Learning Network;IEEE Sensors Journal;2024-06-15 4. Dam Surface Crack Detection Based on CMAM-YOLOv7;2024 5th International Seminar on Artificial Intelligence, Networking and Information Technology (AINIT);2024-03-29 5. Collision Detection and Prevention for Automobiles using Machine Learning;2024 International Conference on Trends in Quantum Computing and Emerging Business Technologies;2024-03-22
|
|