Low-Resource Generation Method for Few-Shot Dolphin Whistle Signal Based on Generative Adversarial Network
-
Published:2023-05-22
Issue:5
Volume:11
Page:1086
-
ISSN:2077-1312
-
Container-title:Journal of Marine Science and Engineering
-
language:en
-
Short-container-title:JMSE
Author:
Wang Huiyuan12, Wu Xiaojun2ORCID, Wang Zirui1ORCID, Hao Yukun12ORCID, Hao Chengpeng3ORCID, He Xinyi4, Hu Qiao15ORCID
Affiliation:
1. School of Mechanical Engineering, Xi’an Jiaotong University, Xi’an 710049, China 2. School of Software Engineering, Xi’an Jiaotong University, Xi’an 710049, China 3. Institute of Acoustics, Chinese Academy of Sciences, Beijing 100190, China 4. Naval Academy of Armament, Beijing 100161, China 5. Shaanxi Key Laboratory of Intelligent Robots, Xi’an Jiaotong University, Xi’an 710049, China
Abstract
Dolphin signals are effective carriers for underwater covert detection and communication. However, the environmental and cost constraints terribly limit the amount of data available in dolphin signal datasets are often limited. Meanwhile, due to the low computational power and resource sensitivity of Unmanned Underwater Vehicles (UUVs), current methods for real-time generation of dolphin signals with favorable results are still subject to several challenges. To this end, a Masked AutoEncoder Generative Adversarial Network (MAE-GAN) model is hereby proposed. First, considering the few-shot condition, the dataset is extended by using data augmentation techniques. Then, to meet the low arithmetic constraint, a denoising autoencoder with a mask is used to obtain latent codes through self-supervised learning. These latent codes are then utilized in Conditional Wasserstein Generative Adversarial Network-Gradient Penalty (CWGAN-GP) to generate a whistle signal model for the target dataset, fully demonstrating the effectiveness of the proposed method for enhancing dolphin signal generation in data-limited scenarios. The whistle signals generated by the MAE-GAN and baseline models are compared with actual dolphin signals, and the findings indicate that the proposed approach achieves a discriminative score of 0.074, which is 28.8% higher than that of the current state-of-the-art techniques. Furthermore, it requires only 30.2% of the computational resources of the baseline model. Overall, this paper presents a novel approach to generating high-quality dolphin signals in data-limited situations, which can also be deployed on low-resource devices. The proposed MAE-GAN methods provide a promising solution to address the challenges of limited data and computational power in generating dolphin signals.
Funder
Major Program of the National Natural Science Foundation of China General Program of the National Natural Science Foundation of China Basic Research Project of China Rapid Support Fund Project of China
Subject
Ocean Engineering,Water Science and Technology,Civil and Structural Engineering
Reference35 articles.
1. Bionic Covert Underwater Communication Focusing on the Overlapping of Whistles and Clicks Generated by Different Cetacean Individuals;Li;Appl. Acoust.,2021 2. Synthesis and Modification of Cetacean Tonal Sounds for Underwater Bionic Covert Detection and Communication;Jiang;IEEE Access,2020 3. Gregorietti, M., Papale, E., Ceraulo, M., de Vita, C., Pace, D.S., Tranchida, G., Mazzola, S., and Buscaino, G. (2021). Acoustic Presence of Dolphins through Whistles Detection in Mediterranean Shallow Waters. J. Mar. Sci. Eng., 9. 4. Graph-Based Clustering of Dolphin Whistles;Kipnis;IEEE/ACM Trans. Audio Speech Lang. Process.,2021 5. Li, P., Liu, X., Palmer, K.J., Fleishman, E., Gillespie, D., Nosal, E.-M., Shiu, Y., Klinck, H., Cholewiak, D., and Helble, T. (2020, January 19–24). Learning Deep Models from Synthetic Data for Extracting Dolphin Whistle Contours. Proceedings of the 2020 International Joint Conference on Neural Networks (IJCNN), Glasgow, UK.
|
|