Improving Few-Shot Learning for Talking Face System with TTS Data Augmentation-Reference-Cited by-同舟云学术

Improving Few-Shot Learning for Talking Face System with TTS Data Augmentation

Published:2023-06-04 Issue: Volume: Page:
ISSN:
Container-title:ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
language:
Short-container-title:

Author:

Chen Qi¹,Ma Ziyang¹,Liu Tao¹,Tan Xu²,Lu Qu³,Yu Kai¹,Chen Xie¹

Affiliation:

1. Shanghai Jiao Tong University,X-LANCE Lab,Department of Computer Science and Engineering, MoE Key Lab of Artificial Intelligence, AI Institute,China

2. Microsoft Research Asia

3. Shanghai Media Tech

Publisher

IEEE

Link

http://xplorestaging.ieee.org/ielx7/10094559/10094560/10094777.pdf?arnumber=10094777

Reference28 articles.

1. SynAug: Synthesis-Based Data Augmentation for Text-Dependent Speaker Verification

2. Speaker Augmentation for Low Resource Speech Recognition

3. Dynamic-programming approach to continuous speech recognition;sakoe;1971 Proc the International Congress of Acoustics,1971

4. Soft-dtw: a differentiable loss function for time-series;cuturi;International Conference on Machine Learning,2017

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A survey on deep learning based reenactment methods for deepfake applications;IET Image Processing;2024-08-19

2. Expressive Talking Avatars;IEEE Transactions on Visualization and Computer Graphics;2024-05

3. DiffDub: Person-Generic Visual Dubbing Using Inpainting Renderer with Diffusion Auto-Encoder;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14

4. A Speech-Driven Facial Motion Method Based on Temporal Loss;Computer Science and Application;2023