EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional Fusion-Reference-Cited by-同舟云学术

EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional Fusion

Published:2021-12-13 Issue: Volume: Page:
ISSN:
Container-title:2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
language:
Short-container-title:

Author:

Tan Daxin¹,Deng Liqun²,Yeung Yu Ting²,Jiang Xin²,Chen Xiao²,Lee Tan¹

Affiliation:

1. The Chinese University of Hong Kong,Department of Electronic Engineering,Hong Kong

2. Huawei Noah's Ark Lab,Shenzhen,China

Funder

Chinese University of Hong Kong

Publisher

IEEE

Link

Reference29 articles.

1. Deep voice 3: 2000-speaker neural text-to-speech;wei;Proc ICLR,0

2. Durian: Duration informed attention network for multimodal synthesis;yu;ArXiv Preprint,2019

3. Fastspeech 2: Fast and high-quality end-to-end text-to-speech;ren;ArXiv Preprint,2020

5. Glow-tts: A generative flow for text-to-speech via monotonic alignment search;kim;Advances in neural information processing systems,2020

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. uSee: Unified Speech Enhancement And Editing with Conditional Diffusion Models;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14

4. DiffVoice: Text-to-Speech with Latent Diffusion;ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2023-06-04