Abstract
AbstractPassive acoustic monitoring (PAM) is a common approach to monitor marine mammal populations, for species of dolphins, porpoises and whales that use sound for navigation, feeding and communication. PAM produces large datasets which benefit from the application of machine learning algorithms to automatically detect and classify the vocalisations of these animals. We present a deep learning approach for the classification of dolphins’ echolocation clicks into two species groups in an environment with high background noise. We compare the use of Convolutional Neural Networks (CNN) and Recurrent Neural Network (RNN), in which we feed the models the raw waveform data and spectrograms. We show that both models perform well, with the highest performance achieved by a CNN fed with spectrograms (F1 score 97 %) and an RNN fed with raw data (F1 score 96%) fitted with Gated Recurrent Units (GRU). We recommend the use of such models to classify echolocation clicks in marine environments where background noise levels exhibit high spatial and temporal variance. In particular, the RNN showed excellent performance, while being fed with raw data, in terms of reduced processing time and storage. Deep learning automatically extracts effective features from the raw waveform in the training process through multiple layers of the model, without the need to rely on feature extraction in a separate pre-processing step.
Publisher
Cold Spring Harbor Laboratory
Reference42 articles.
1. A Convolutional Neural Network for Automated Detection of Humpback Whale Song in a Diverse, Long-Term Passive Acoustic Dataset
2. Hearing by Whales and Dolphins
3. Signal Processing and Signal Processing Models
4. Au, W. W. L. (2002). “Echolocation,” In W. F. Perrin , B. Wursig , and J. G. M. Thewissen (Eds.), Encyclopedia of Marine Mammals, Academic Press, San Diego, USA, pp. 358–367.
5. Automatic gain control in the echolocation system of dolphins
Cited by
24 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献