Classifying marine mammals signal using cubic splines interpolation combining with triple loss variational auto-encoder-Reference-Cited by-同舟云学术

Classifying marine mammals signal using cubic splines interpolation combining with triple loss variational auto-encoder

Published:2023-11-15 Issue:1 Volume:13 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Bach Nhat Hoang,Vu Le Ha,Nguyen Van Duc,Pham Duy Phong

Abstract

AbstractIn practical applications of passive sonar principles for extracting characteristic frequencies of acoustic signals, scientists typically employ traditional time-frequency domain transformation methods such as Mel-frequency, Short time Fourier transform (STFT), and Wavelet transform (WT). However, these solutions still face limitations in resolution and information loss when transforming data collected over extended periods. In this paper, we present a study using a two-stage approach that combines pre-processing by Cubic-splines interpolation (CSI) with a probability distribution in the hidden space with Siamese triple loss network model for classifying marine mammal (MM) communication signals. The Cubic-splines interpolation technique is tested with the STFT transformation to generate STFT-CSI spectrograms, which enforce stronger relationships between characteristic frequencies, enhancing the connectivity of spectrograms and highlighting frequency-based features. Additionally, stacking spectrograms generated by three consecutive methods, Mel, STFT-CSI, and Wavelet, into a feature spectrogram optimizes the advantages of each method across different frequency bands, resulting in a more effective classification process. The proposed solution using an Siamese Neural Network-Variational Auto Encoder (SNN-VAE) model also overcomes the drawbacks of the Auto-Encoder (AE) structure, including loss of discontinuity and loss of completeness during decoding. The classification accuracy of marine mammal signals using the SNN-VAE model increases by 11% and 20% compared to using the AE model (2013), and by 6% compared to using the Resnet model (2022) on the same actual dataset NOAA from the National Oceanic and Atmospheric Administration - United State of America.

Publisher

Springer Science and Business Media LLC

Subject

Multidisciplinary

Link

https://www.nature.com/articles/s41598-023-47320-4.pdf

Reference74 articles.

1. D’Amico, A. et al. Beaked whale strandings and naval exercises (Tech. Rep, SPACE AND NAVAL WARFARE SYSTEMS CENTER SAN DIEGO CA, 2009).

2. Ketten, D. Sonars and strandings: Are beaked whales the aquatic acoustic canary. Acoust. Today 10, 46–56 (2014).

3. Clark, C., Marler, P. & Beeman, K. Quantitative analysis of animal vocal phonology: An application to swamp sparrow song. Ethology 76, 101–115 (1987).

4. Nhat, H. B. et al. Optimizing baseline in usbl using costas hopping to increase navigation precision in shallow water. In 2022 16th International Conference on Ubiquitous Information Management and Communication (IMCOM), 1–6 (IEEE, 2022).

5. Roch, M. et al. Classification of echolocation clicks from odontocetes in the southern california bight. J. Acoust. Soc. Am. 129, 467–475 (2011).

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Perspective Chapter: Enhancing Regression Analysis with Splines and Machine Learning – Evaluation of How to Capture Complex Non-Linear Multidimensional Variables;Nonlinear Systems and Matrix Analysis - Recent Advances in theory and Applications [Working Title];2024-09-11

2. Risk and impact-centered non-stationary signal analysis based on fault signatures for Djibouti power system;Electrical Engineering;2024-03-29