Latent space visualization, characterization, and generation of diverse vocal communication signals-Reference-Cited by-同舟云学术

Latent space visualization, characterization, and generation of diverse vocal communication signals

Published:2019-12-11 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Sainburg Tim^ORCID,Thielk Marvin^ORCID,Gentner Timothy Q^ORCID

Abstract

ABSTRACTAnimals produce vocalizations that range in complexity from a single repeated call to hundreds of unique vocal elements patterned in sequences unfolding over hours. Characterizing complex vocalizations can require considerable effort and a deep intuition about each species’ vocal behavior. Even with a great deal of experience, human characterizations of animal communication can be affected by human perceptual biases. We present here a set of computational methods that center around projecting animal vocalizations into low dimensional latent representational spaces that are directly learned from data. We apply these methods to diverse datasets from over 20 species, including humans, bats, songbirds, mice, cetaceans, and nonhuman primates, enabling high-powered comparative analyses of unbiased acoustic features in the communicative repertoires across species. Latent projections uncover complex features of data in visually intuitive and quantifiable ways. We introduce methods for analyzing vocalizations as both discrete sequences and as continuous latent variables. Each method can be used to disentangle complex spectro-temporal structure and observe long-timescale organization in communication. Finally, we show how systematic sampling from latent representational spaces of vocalizations enables comprehensive investigations of perceptual and neural representations of complex and ecologically relevant acoustic feature spaces.

Publisher

Cold Spring Harbor Laboratory

Reference111 articles.

1. Acoustic sequences in non-human animals: a tutorial review and prospectus

2. Songs to syntax: the linguistics of birdsong

3. Parallels in the sequential organization of birdsong and human speech;Nature communications,2019

4. A simple explanation for the evolution of complex song syntax in bengalese finches;Biology letters,2013

5. Long-range order in canary song;PLoS computational biology,2013

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. SqueakOut: Autoencoder-based segmentation of mouse ultrasonic vocalizations;2024-04-23

2. Bidirectional Generative Adversarial Representation Learning for Natural Stimulus Synthesis;2023-10-17

3. Information Theory Opens New Dimensions in Experimental Studies of Animal Behaviour and Communication;Animals;2023-03-26

4. Deep audio embeddings for vocalisation clustering;2023-03-12

5. Extracting extended vocal units from two neighborhoods in the embedding plane;2022-09-27