Underwater Side-Scan Sonar Target Detection: YOLOv7 Model Combined with Attention Mechanism and Scaling Factor
-
Published:2024-07-08
Issue:13
Volume:16
Page:2492
-
ISSN:2072-4292
-
Container-title:Remote Sensing
-
language:en
-
Short-container-title:Remote Sensing
Author:
Wen Xin1ORCID, Wang Jian2, Cheng Chensheng1, Zhang Feihu1ORCID, Pan Guang1
Affiliation:
1. School of Marine Science and Technology, Northwestern Polytechnical University, Xi’an 710072, China 2. Marine Design & Research Institute of China, Shanghai 200011, China
Abstract
Side-scan sonar plays a crucial role in underwater exploration, and the autonomous detection of side-scan sonar images is vital for detecting unknown underwater environments. However, due to the complexity of the underwater environment, the presence of a few highlighted areas on the targets, blurred feature details, and difficulty in collecting data from side-scan sonar, achieving high-precision autonomous target recognition in side-scan sonar images is challenging. This article addresses this problem by improving the You Only Look Once v7 (YOLOv7) model to achieve high-precision object detection in side-scan sonar images. Firstly, given that side-scan sonar images contain large areas of irrelevant information, this paper introduces the Swin-Transformer for dynamic attention and global modeling, which enhances the model’s focus on the target regions. Secondly, the Convolutional Block Attention Module (CBAM) is utilized to further improve feature representation and enhance the neural network model’s accuracy. Lastly, to address the uncertainty of geometric features in side-scan sonar target features, this paper innovatively incorporates a feature scaling factor into the YOLOv7 model. The experiment initially verified the necessity of attention mechanisms in the public dataset. Subsequently, experiments on our side-scan sonar (SSS) image dataset show that the improved YOLOv7 model has 87.9% and 49.23% in its average accuracy (mAP0.5) and (mAP0.5:0.95), respectively. These results are 9.28% and 8.41% higher than the YOLOv7 model. The improved YOLOv7 algorithm proposed in this paper has great potential for object detection and the recognition of side-scan sonar images.
Funder
National Key R&D Program of China
Reference39 articles.
1. Real-time underwater target detection for AUV using side scan sonar images based on deep learning;Li;Appl. Ocean. Res.,2023 2. Wu, M., Wang, Q., Rigall, E., Li, K., Zhu, W., He, B., and Yan, T. (2019). ECNet: Efficient convolutional networks for side scan sonar image segmentation. Sensors, 19. 3. Yu, Y., Zhao, J., Gong, Q., Huang, C., Zheng, G., and Ma, J. (2021). Real-time underwater maritime object detection in side-scan sonar images based on transformer-YOLOv5. Remote Sens., 13. 4. Chen, Z., Wang, H., Shen, J., and Dong, X. (2014). Underwater object detection by combining the spectral residual and three-frame algorithm. Advances in Computer Science and Its Applications: CSA 2013, Springer. 5. Villar, S.A., Acosta, G.G., and Solari, F.J. (2015, January 6–9). OS-CFAR process in 2-D for object segmentation from Sidescan Sonar data. Proceedings of the 2015 XVI Workshop on Information Processing and Control (RPIC), Cordoba, Argentina.
|
|