ECOGEN: Bird sounds generation using deep learning

Author:

Guei Axel‐Christian12,Christin Sylvain2ORCID,Lecomte Nicolas2ORCID,Hervet Éric1ORCID

Affiliation:

1. Department of Computer Science University of Moncton Moncton New Brunswick Canada

2. Department of Biology, Canada Research Chair in Polar and Boreal Ecology and Centre d'Études Nordiques University of Moncton Moncton New Brunswick Canada

Abstract

Abstract Large‐scale acoustic projects generate vast amounts of data that can now be efficiently processed using deep learning tools. However, these tools often face limitations due to sound labeling and imbalanced sampling. Data augmentation can help overcome such challenges, particularly through the generation of synthetic and lifelike sounds. Synthetic samples can be valuable not only for deep learning but also for species with limited available data. Despite advancements in computer power, sound generation remains a time‐consuming process, even requiring a substantial number of samples. We present ECOGEN, a novel deep learning approach designed to generate realistic bird songs for biologists and ecologists. The primary objective of ECOGEN is to enhance the number of samples in under‐represented bird song classes, thereby improving the performance and robustness of classifiers in ecological research.The ECOGEN framework employs spectrograms as a representation of bird songs and leverages proven image generation techniques to create new spectrograms, subsequently converted back to digital audio signals. As a class‐agnostic tool, ECOGEN is applicable to a wide range of biophonic sounds, including mammal and insect calls. We show that adding samples generated by ECOGEN to a bird song classifier improved the classification accuracy by 12% on average and improved results compared with classic data augmentation techniques 80% of the time. Our approach is both fast and efficient, enabling the generation of synthetic bird songs on standard computing resources. By facilitating the creation of synthetic bird songs, ECOGEN can contribute to the conservation of endangered bird species, while providing valuable insights into their vocalizations, behaviours and habitat preferences. Future development of ECOGEN can be easily implemented and could focus on incorporating additional configurable parameters during the generation phase for increased control over the output, catering to the specific needs of biologists.

Funder

Canada Foundation for Innovation

Canada Research Chairs

Canadian Network for Research and Innovation in Machining Technology, Natural Sciences and Engineering Research Council of Canada

Publisher

Wiley

Subject

Ecological Modeling,Ecology, Evolution, Behavior and Systematics

Reference55 articles.

1. Alonso J. &Erkut C.(2021).Latent space explorations of singing voice synthesis using DDSP.arXiv:2103.07197 [cs eess]. arXiv: 2103.07197 version: 1.http://arxiv.org/abs/2103.07197

2. Bat detective—Deep learning tools for bat acoustic signal detection

3. Bear H. L. Morfi V. &Benetos E.(2021).An evaluation of data augmentation methods for sound scene geotagging.arXiv: 2110.04585 [eess.AS].

4. Representation of Audio Signals

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3