Abstract
Sonar image is the main way for underwater vehicles to obtain environmental information. The task of target detection in sonar images can distinguish multi-class targets in real time and accurately locate them, providing perception information for the decision-making system of underwater vehicles. However, there are many challenges in sonar image target detection, such as many kinds of sonar, complex and serious noise interference in images, and less datasets. This paper proposes a sonar image target detection method based on Dual Path Vision Transformer Network (DP-VIT) to accurately detect targets in forward-look sonar and side-scan sonar. DP-ViT increases receptive field by adding multi-scale to patch embedding enhances learning ability of model feature extraction by using Dual Path Transformer Block, then introduces Conv-Attention to reduce model training parameters, and finally uses Generalized Focal Loss to solve the problem of imbalance between positive and negative samples. The experimental results show that the performance of this sonar target detection method is superior to other mainstream methods on both forward-look sonar dataset and side-scan sonar dataset, and it can also maintain good performance in the case of adding noise.
Funder
Science and Technology Project of Shaanxi Province Yinhan Jiwei Engineering Construction Co., Ltd.
Shaanxi Provincial Water Conservancy Science and Technology Program
Heilongjiang Provincial Natural Science Foundation
Acoustics Science and Technology Laboratory
Subject
General Earth and Planetary Sciences
Reference42 articles.
1. Submarine pipeline tracking technology based on AUVs with forward looking sonar;Appl. Ocean Res.,2022
2. A Review of Current Research and Advances in Unmanned Surface Vehicles;J. Mar. Sci. Appl.,2022
3. Automatic Target Recognition for Mine Countermeasure Missions Using Forward-Looking Sonar Data;IEEE J. Ocean Eng.,2021
4. Tang, Y., Jin, S., Xiao, F., Bian, G., and Zhang, Y. (2020, January 23–25). Recognition of Side-scan Sonar Shipwreck Image Using Convolutional Neural Network. Proceedings of the 2020 2nd International Conference on Machine Learning, Big Data and Business Intelligence (MLBDBI), Taiyuan, China.
5. Grządziel, A. (2022). Application of Remote Sensing Techniques to Identification of Underwater Airplane Wreck in Shallow Water Environment: Case Study of the Baltic Sea, Poland. Remote Sens., 14.
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献