Affiliation:
1. Institute of Communication Systems, Faculty of Electronics, Military University of Technology, 00-908 Warsaw, Poland
Abstract
Voice conversion is a process where the essence of a speaker’s identity is seamlessly transferred to another speaker, all while preserving the content of their speech. This usage is accomplished using algorithms that blend speech processing techniques, such as speech analysis, speaker classification, and vocoding. The cutting-edge voice conversion technology is characterized by deep neural networks that effectively separate a speaker’s voice from their linguistic content. This article offers a comprehensive overview of the development status of this area of science based on the current state-of-the-art voice conversion methods.
Funder
National Centre for Research and Development
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference66 articles.
1. Voice conversion;Childers;Speech Commun.,1989
2. An Overview of Voice Conversion Systems;Mohammadi;Speech Commun.,2017
3. An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning;Sisman;IEEE/ACM Trans. Audio Speech Lang. Process.,2020
4. Variani, E., Lei, X., McDermott, E., Moreno, I.L., and Gonzalez-Dominguez, J. (2014, January 4–9). Deep Neural Networks for Small Footprint Text-dependent Speaker Verification. Proceedings of the 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Florence, Italy.
5. Voice conversion versus speaker verification: An overview;Wu;APSIPA Trans. Signal Inf. Process.,2014
Cited by
12 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献