Abstract
This paper draws inspiration from image style transfer model - neural style transfer, which leads to the research topic of speech style transfer based on neural network. First, the article describes the extraction process on 2D spectrogram of speech signal. Then, the speech style transfer based on convolutional neural network is constructed.
Publisher
Darcy & Roy Press Co. Ltd.
Reference12 articles.
1. Childers D G, Wu K, Hicks D M, et al. Voice conversion[J]. Speech Communication, 1989, 8(2):147-158.
2. Byron D K, Pikovsky A, Woods E. Text-to-speech for digital literature, US9183831[P]. 2015.
3. Schwardt L. C., Du Preez J. A., Voice conversion based on static speaker Characteristics IEEE COMSIG-98, Cape Town, September 1998, 57~62.
4. Qi Yingyong, Weinbery B. Bi Ning. Enhancement of female esophageal and tracheoesophageal speech, J. Acoust. Soc. Am., Nov. 1998, (5): 2461~2465.
5. Sundermann D., Ney H., Hoge H., VTLN-based cross-language voice conversion. In IEEE Automatic Speech Recognition and Understanding Workshop, 2003, 676~681.