Author:
Moss Henry B.,Aggarwal Vatsal,Prateek Nishant,Gonzalez Javier,Barra-Chicote Roberto
Cited by
42 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. MRMI-TTS: Multi-Reference Audios and Mutual Information Driven Zero-Shot Voice Cloning;ACM Transactions on Asian and Low-Resource Language Information Processing;2024-05-10
2. Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
3. Bayesian Optimization with Gaussian Processes for Robust Localization;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
4. One-shot multi-speaker text-to-speech using RawNet3 speaker
representation*;Phonetics and Speech Sciences;2024-03
5. USAT: A Universal Speaker-Adaptive Text-to-Speech Approach;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024