Efficient Small-Object Detection in Underwater Images Using the Enhanced YOLOv8 Network-Reference-Cited by-同舟云学术

Efficient Small-Object Detection in Underwater Images Using the Enhanced YOLOv8 Network

Published:2024-01-27 Issue:3 Volume:14 Page:1095
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Zhang Minghua¹,Wang Zhihua¹,Song Wei¹^ORCID,Zhao Danfeng¹^ORCID,Zhao Huijuan¹

Affiliation:

1. College of Information Technology, Shanghai Ocean University, Shanghai 201306, China

Abstract

Underwater object detection plays a significant role in marine ecosystem research and marine species conservation. The improvement of related technologies holds practical significance. Although existing object-detection algorithms have achieved an excellent performance on land, they are not satisfactory in underwater scenarios due to two limitations: the underwater objects are often small, densely distributed, and prone to occlusion characteristics, and underwater embedded devices have limited storage and computational capabilities. In this paper, we propose a high-precision, lightweight underwater detector specifically optimizing for underwater scenarios based on the You Only Look Once Version 8 (YOLOv8) model. Firstly, we replace the Darknet-53 backbone of YOLOv8s with FasterNet-T0, reducing model parameters by 22.52%, FLOPS by 23.59%, and model size by 22.73%, achieving model lightweighting. Secondly, we add a Prediction Head for Small Objects, increase the number of channels for high-resolution feature map detection heads, and decrease the number of channels for low-resolution feature map detection heads. This results in a 1.2% improvement in small-object detection accuracy, while the remaining model parameters and memory consumption are nearly unchanged. Thirdly, we use Deformable ConvNets and Coordinate Attention in the neck part to enhance the accuracy in the detection of irregularly shaped and densely occluded small targets. This is achieved by learning convolution offsets from feature maps and emphasizing the regions of interest (RoIs). Our method achieves 52.12% AP on the underwater dataset UTDAC2020, with only 8.5 M parameters, 25.5 B FLOPS, and 17 MB model size. It surpasses the performance of large model YOLOv8l, at 51.69% AP, with 43.6 M parameters, 164.8 B FLOPS, and 84 MB model size. Furthermore, by increasing the input image resolution to 1280 × 1280 pixels, our model achieves 53.18% AP, making it the state-of-the-art (SOTA) model for the UTDAC2020 underwater dataset. Additionally, we achieve 84.4% mAP on the Pascal VOC dataset, with a substantial reduction in model parameters compared to previous, well-established detectors. The experimental results demonstrate that our proposed lightweight method retains effectiveness on underwater datasets and can be generalized to common datasets.

Funder

the National Natural Science Foundation of China

Publisher

MDPI AG

Link

https://www.mdpi.com/2076-3417/14/3/1095/pdf

Reference63 articles.

1. Active underwater detection with an array of atomic magnetometers;Deans;Appl. Opt.,2018

2. Deep sea habitats in the chemical warfare dumping areas of the Baltic Sea;Czub;Sci. Total Environ.,2018

3. Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27–30). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.

4. Spatial pyramid pooling in deep convolutional networks for visual recognition;He;IEEE Trans. Pattern Anal. Mach. Intell.,2015

5. He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27–30). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Application of Target Detection Based on Deep Learning in Intelligent Mineral Identification;Minerals;2024-08-27

2. Research on Infrared Dim Target Detection Based on Improved YOLOv8;Remote Sensing;2024-08-07

3. A Precise Plot-Level Rice Yield Prediction Method Based on Panicle Detection;Agronomy;2024-07-24

4. Deep Learning Test Platform for Maritime Applications: Development of the eM/S Salama Unmanned Surface Vessel and Its Remote Operations Center for Sensor Data Collection and Algorithm Development;Remote Sensing;2024-04-26

5. Improved YOLOv8 Model for a Comprehensive Approach to Object Detection and Distance Estimation;IEEE Access;2024