An Improved YOLOv5 Underwater Detector Based on an Attention Mechanism and Multi-Branch Reparameterization Module
-
Published:2023-06-08
Issue:12
Volume:12
Page:2597
-
ISSN:2079-9292
-
Container-title:Electronics
-
language:en
-
Short-container-title:Electronics
Author:
Zhang Jian12ORCID, Chen Hongda2ORCID, Yan Xinyue2ORCID, Zhou Kexin2ORCID, Zhang Jinshuai2ORCID, Zhang Yonghui1ORCID, Jiang Hong1, Shao Bingqian2
Affiliation:
1. School of Information and Communication Engineering, Hainan University, Haikou 570228, China 2. School of Applied Science and Technology, Hainan University, Haikou 570228, China
Abstract
Underwater target detection is a critical task in various applications, including environmental monitoring, underwater exploration, and marine resource management. As the demand for underwater observation and exploitation continues to grow, there is a greater need for reliable and efficient methods of detecting underwater targets. However, the unique underwater environment often leads to significant degradation of the image quality, which results in reduced detection accuracy. This paper proposes an improved YOLOv5 underwater-target-detection network to enhance accuracy and reduce missed detection. First, we added the global attention mechanism (GAM) to the backbone network, which could retain the channel and spatial information to a greater extent and strengthen cross-dimensional interaction so as to improve the ability of the backbone network to extract features. Then, we introduced the fusion block based on DAMO-YOLO for the neck, which enhanced the system’s ability to extract features at different scales. Finally, we used the SIoU loss to measure the degree of matching between the target box and the regression box, which accelerated the convergence and improved the accuracy. The results obtained from experiments on the URPC2019 dataset revealed that our model achieved an mAP@0.5 score of 80.2%, representing a 1.8% and 2.3% increase in performance compared to YOLOv7 and YOLOv8, respectively, which means our method achieved state-of-the-art (SOTA) performance. Moreover, additional evaluations on the MS COCO dataset indicated that our model’s mAP@0.5:0.95 reached 51.0%, surpassing advanced methods such as ViDT and RF-Next, demonstrating the versatility of our enhanced model architecture.
Funder
the Key Research and Development Project of Hainan Province the Hainan Provincial Natural Science Foundation of China
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference75 articles.
1. Robust Underwater Localization Using Acoustic Image Alignment for Autonomous Intervention Systems;Park;IEEE Access,2022 2. Motion Estimation of Underwater Platforms Using Impulse Responses From the Seafloor;Henson;IEEE Access,2022 3. Baweja, P.S., and Maurya, P. (2022, January 21–24). Acoustics Based Docking for a Coral Reef Monitoring Robot (C-Bot). Proceedings of the OCEANS 2022, OCEANS-IEEE, OCEANS Conference, Chennai, India. 4. Zhao, Y., Zhang, F., Li, D., Jin, B., Lin, R., and Zhang, Z. (2022, January 17–20). Research on AUV terminal electromagnetic positioning system based on two coils. Proceedings of the 2022 OCEANS Hampton Roads, 2022, OCEANS-IEEE, OCEANS Hampton Roads Conference, Hampton Roads, VA, USA. 5. Lin, R., Zhao, Y., Li, D., Lin, M., and Yang, C. (2022). Underwater Electromagnetic Guidance Based on the Magnetic Dipole Model Applied in AUV Terminal Docking. J. Mar. Sci. Eng., 10.
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|