Convolutional Neural Networks for the Identification of African Lions from Individual Vocalizations-Reference-Cited by-同舟云学术

Convolutional Neural Networks for the Identification of African Lions from Individual Vocalizations

Published:2022-04-01 Issue:4 Volume:8 Page:96
ISSN:2313-433X
Container-title:Journal of Imaging
language:en
Short-container-title:J. Imaging

Author:

Trapanotto Martino,Nanni Loris^ORCID,Brahnam Sheryl^ORCID,Guo Xiang^ORCID

Abstract

The classification of vocal individuality for passive acoustic monitoring (PAM) and census of animals is becoming an increasingly popular area of research. Nearly all studies in this field of inquiry have relied on classic audio representations and classifiers, such as Support Vector Machines (SVMs) trained on spectrograms or Mel-Frequency Cepstral Coefficients (MFCCs). In contrast, most current bioacoustic species classification exploits the power of deep learners and more cutting-edge audio representations. A significant reason for avoiding deep learning in vocal identity classification is the tiny sample size in the collections of labeled individual vocalizations. As is well known, deep learners require large datasets to avoid overfitting. One way to handle small datasets with deep learning methods is to use transfer learning. In this work, we evaluate the performance of three pretrained CNNs (VGG16, ResNet50, and AlexNet) on a small, publicly available lion roar dataset containing approximately 150 samples taken from five male lions. Each of these networks is retrained on eight representations of the samples: MFCCs, spectrogram, and Mel spectrogram, along with several new ones, such as VGGish and stockwell, and those based on the recently proposed LM spectrogram. The performance of these networks, both individually and in ensembles, is analyzed and corroborated using the Equal Error Rate and shown to surpass previous classification attempts on this dataset; the best single network achieved over 95% accuracy and the best ensembles over 98% accuracy. The contributions this study makes to the field of individual vocal classification include demonstrating that it is valuable and possible, with caution, to use transfer learning with single pretrained CNNs on the small datasets available for this problem domain. We also make a contribution to bioacoustics generally by offering a comparison of the performance of many state-of-the-art audio representations, including for the first time the LM spectrogram and stockwell representations. All source code for this study is available on GitHub.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Graphics and Computer-Aided Design,Computer Vision and Pattern Recognition,Radiology, Nuclear Medicine and imaging

Link

https://www.mdpi.com/2313-433X/8/4/96/pdf

Reference76 articles.

1. Acoustic communication in lions and its use in territoriality;Ramsauer;Cogn. Brain Behav.,2005

2. Roaring and numerical assessment in contests between groups of female lions, Panthera leo

3. An acoustic analysis of lion roars. I: Data collection and spectrogram and waveform analyses;Eklund,2011

4. Vocal discrimination of African lions and its potential for collar-free tracking

5. Do acoustic features of lion, Panthera leo, roars reflect sex and male condition?

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Windy events detection in big bioacoustics datasets using a pre-trained Convolutional Neural Network;Science of The Total Environment;2024-11

2. Infant cry classification using an efficient graph structure and attention-based model;Kuwait Journal of Science;2024-07

3. Knowing a fellow by their bellow: acoustic individuality in the bellows of the American alligator;Animal Behaviour;2024-01

4. Using autonomous recording units for vocal individuality: insights from Barred Owl identification;Avian Conservation and Ecology;2024

5. Calls of Manx shearwater Puffinus puffinus contain individual signatures;Journal of Avian Biology;2023-12-21