Improved YOLOv7 for Small Object Detection Algorithm Based on Attention and Dynamic Convolution-Reference-Cited by-同舟云学术

Improved YOLOv7 for Small Object Detection Algorithm Based on Attention and Dynamic Convolution

Published:2023-08-16 Issue:16 Volume:13 Page:9316
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Li Kai¹,Wang Yanni¹,Hu Zhongmian¹

Affiliation:

1. College of Information and Control Engineering, Xi’an University of Architecture and Technology, Xi’an 710311, China

Abstract

The rapid advancement of deep learning has significantly accelerated progress in target detection. However, the detection of small targets remains challenging due to their susceptibility to size variations. In this paper, we address these challenges by leveraging the latest version of the You Only Look Once (YOLOv7) model. Our approach enhances the YOLOv7 model to improve feature preservation and minimize feature loss during network processing. We introduced the Spatial Pyramid Pooling and Cross-Stage Partial Channel (SPPCSPC) module, which combines the feature separation and merging ideas. To mitigate missed detections in small target scenarios and reduce noise impact, we incorporated the Coordinate Attention for Efficient Mobile Network Design (CA) module strategically. Additionally, we introduced a dynamic convolutional module to address misdetection and leakage issues stemming from significant target size variations, enhancing network robustness. An experimental validation was conducted on the FloW-Img sub-dataset provided by Okahublot. The results demonstrated that our enhanced YOLOv7 model outperforms the original network, exhibiting significant improvement in leakage reduction, with a mean Average Precision (mAP) of 81.1%. This represents a 5.2 percentage point enhancement over the baseline YOLOv7 model. In addition, the new model also has some advantages over the latest small-target-detection algorithms such as FCOS and VFNet in some respects.

Funder

Natural Science Foundation of Shaanxi Province, China

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/13/16/9316/pdf

Reference34 articles.

1. Recent advances in small object detection;Gao;J. Data Acquis. Process.,2021

2. Viola, P., and Jones, M. (2001, January 8–14). Rapid object detection using a boosted cascade of simple features. Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2001, Kauai, HI, USA. Available online: http://ieeexplore.ieee.org/document/990517/.

3. Dalal, N., and Triggs, B. (2005, January 20–25). Histograms of oriented gradients for human detection. Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’05), San Diego, CA, USA.

4. A review of object detection study based on deep learning;Gu;Mod. Inf. Technol.,2022

5. Chen, C., Liu, M.Y., Tuzel, O., and Xiao, J. (2016, January 27–30). R-CNN for small object detection. Proceedings of the IEEE International Conference on Computer Vision, Las Vegas, NV, USA.

Cited by 19 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Underwater Robot Target Detection Algorithm Based on YOLOv8;Electronics;2024-08-25

2. Improved YOLOv8 Algorithm for Water Surface Object Detection;Sensors;2024-08-05

3. CT-YoloTrad: fast and accurate recognition of point-distributed coded targets for UAV images incorporating CT-YOLOv7;Physica Scripta;2024-07-18

4. Enhanced floating debris detection algorithm based on CDW-YOLOv8;Physica Scripta;2024-06-24

5. High-Speed Motion Target Real-Time Detection Based on Lightweight Deep Feature Learning Network;IEEE Sensors Journal;2024-06-15