Fast and accurate annotation of acoustic signals with deep neural networks
Author:
Steinfath Elsa12ORCID, Palacios-Muñoz Adrian12ORCID, Rottschäfer Julian R12ORCID, Yuezak Deniz12, Clemens Jan13ORCID
Affiliation:
1. European Neuroscience Institute - A Joint Initiative of the University Medical Center Göttingen and the Max-Planck-Society, Göttingen, Germany 2. International Max Planck Research School and Göttingen Graduate School for Neurosciences, Biophysics, and Molecular Biosciences (GGNB) at the University of Göttingen, Göttingen, Germany 3. Bernstein Center for Computational Neuroscience, Göttingen, Germany
Abstract
Acoustic signals serve communication within and across species throughout the animal kingdom. Studying the genetics, evolution, and neurobiology of acoustic communication requires annotating acoustic signals: segmenting and identifying individual acoustic elements like syllables or sound pulses. To be useful, annotations need to be accurate, robust to noise, and fast.We here introduce DeepAudioSegmenter (DAS), a method that annotates acoustic signals across species based on a deep-learning derived hierarchical presentation of sound. We demonstrate the accuracy, robustness, and speed of DAS using acoustic signals with diverse characteristics from insects, birds, and mammals. DAS comes with a graphical user interface for annotating song, training the network, and for generating and proofreading annotations. The method can be trained to annotate signals from new species with little manual annotation and can be combined with unsupervised methods to discover novel signal types. DAS annotates song with high throughput and low latency for experimental interventions in realtime. Overall, DAS is a universal, versatile, and accessible tool for annotating acoustic communication signals.
Funder
Deutsche Forschungsgemeinschaft European Research Council
Publisher
eLife Sciences Publications, Ltd
Subject
General Immunology and Microbiology,General Biochemistry, Genetics and Molecular Biology,General Medicine,General Neuroscience
Reference87 articles.
1. Abadi M, Barham P, Chen J, Chen Z, Davis A, Dean J, Devin M, Ghemawat S, Irving G, Isard M, Kudlur M, Levenberg J, Monga R, Moore S, Murray DG, Steiner B, Tucker P, Vasudevan V, Warden P, Wicke M, Yu Y, Zheng X. 2016. Tensorflow: A System for Large-Scale Machine Learning OSDI’16. https://www.usenix.org/system/files/conference/osdi16/osdi16-abadi.pdf. 2. Multi-channel acoustic recording and automated analysis of Drosophila courtship songs;Arthur;BMC biology,2013 3. Songexplorer: a deep learning workflow for discovery and segmentation of animal acoustic communication signals;Arthur,2021 4. An empirical evaluation of generic convolutional and recurrent networks for sequence modeling;Bai,2018 5. Acoustic Pattern Recognition and Courtship Songs: Insights from Insects;Baker;Annual review of neuroscience,2019
Cited by
36 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|