Affiliation:
1. Hohai University
2. Suzhou University of Science and Technology
3. Tongji University
Abstract
Abstract
This study presents a comprehensive analysis and improvement of the YOLOv8-n algorithm for object detection, focusing on the integration of Wasserstein Distance Loss, FasterNext, and Context Aggravation strategies. Through a detailed ablation study, each strategy was systematically evaluated individually and collectively to assess its contribution to the model's performance. The results indicate that each strategy uniquely enhances the model's performance, significantly increasing mAP and reducing model complexity when all three are integrated. Visualizations through Grad-CAM further substantiate the improved model's capacity to extract and focus on key object features. Comparisons with existing models, such as YOLOv5-n, YOLOv5-s, YOLOX-n, YOLOX-s, and YOLOv7-tiny, the improved YOLOv8-n model achieves an optimal balance between accuracy and model complexity, outperforming other models in terms of model accuracy, model complexity, and model inference speed. Further image inference tests validate the model's performance, showcasing its superior detection capabilities.
Publisher
Research Square Platform LLC
Reference22 articles.
1. A Survey of the Four Pillars for Small Object Detection: Multi-scale Representation, Contextual Information, Super-Resolution, and Region Proposal;Chen G;IEEE T. Syst Man Cy-S.,2022
2. Lin, T-Y., Dollár, P., Girshick, R., He, K., Hariharan, B., Belongie, S.: Feature Pyramid Networks for Object Detection. arXiv preprint arXiv:1612.03144 (2017)
3. Hierarchical Objectness Network for Region Proposal Generation and Object Detection;Wang J;Pattern Recogn.,2018
4. ReDet: A Rotation-Equivariant Detector for Aerial Object Detection;Han J,2021
5. Oriented Bounding Boxes for Small and Freely Rotated Objects;Zand M;IEEE T. Geosci Remote,2022
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献