Voice Filter: Few-Shot Text-to-Speech Speaker Adaptation Using Voice Conversion as a Post-Processing Module-Reference-Cited by-同舟云学术

Voice Filter: Few-Shot Text-to-Speech Speaker Adaptation Using Voice Conversion as a Post-Processing Module

Published:2022-05-23 Issue: Volume: Page:
ISSN:
Container-title:ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
language:
Short-container-title:

Author:

Gabrys Adam¹,Huybrechts Goeric¹,Ribeiro Manuel Sam¹,Chien Chung-Ming²,Roth Julian¹,Comini Giulia¹,Barra-Chicote Roberto¹,Perz Bartek¹,Lorenzo-Trueba Jaime¹

Affiliation:

1. Alexa AI

2. National Taiwan University (NTU)

Publisher

IEEE

Link

http://xplorestaging.ieee.org/ielx7/9745891/9746004/09747239.pdf?arnumber=9747239

Reference33 articles.

1. CopyCat: Many-to-Many Fine-Grained Prosody Transfer for Neural Text-to-Speech

2. wav2vec 2.0: A framework for self-supervised learning of speech representations;baevski;Advances in neural information processing systems,2020

3. fairseq: A Fast, Extensible Toolkit for Sequence Modeling

4. High fidelity speech synthesis with adversarial networks;bi?kowski;International Conference on Learning Representations,2020

5. Neural discrete representation learning;van den oord;Advances in neural information processing systems,2017

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Speaker Adaptation For Enhancement Of Bone-Conducted Speech;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14

2. Robust Speaker Personalisation Using Generalized Low-Rank Adaptation for Automatic Speech Recognition;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14

3. Hvqu$^{2}$-Vc: A One Shot Voice Conversion by Integrating Hierarchical Vector Quantization and Nested U-Net Structure;2024

4. A review of deep learning techniques for speech processing;Information Fusion;2023-11

5. The Effect of Human Prosody on Comprehension of TTS Robot Speech;2023 32nd IEEE International Conference on Robot and Human Interactive Communication (RO-MAN);2023-08-28