Funder
National Science Foundation
Subject
Artificial Intelligence,Computer Vision and Pattern Recognition,Signal Processing,Software
Reference26 articles.
1. Contrastive learning of general-purpose audio representations;Saeed,2021
2. J. Shor, A. Jansen, R. Maor, O. Lang, O. Tuval, F. de Chaumont Quitry, M. Tagliasacchi, I. Shavitt, D. Emanuel, Y. Haviv, Towards Learning a Universal Non-Semantic Representation of Speech, in: Proc. Interspeech 2020, 2020, pp. 140–144.
3. J. Shor, S. Venugopalan, TRILLsson: Distilled Universal Paralinguistic Speech Representations, in: Proc. Interspeech 2022, 2022, pp. 356–360.
4. Voxforge;MacLean,2018
5. CREMA-D: Crowd-sourced emotional multimodal actors dataset;Cao;IEEE Trans. Affect. Comput.,2014