1. A Kain, MW Macon, in Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Spectral voice conversion for text-to-speech synthesis, (1998), pp. 285–288.
2. C Veaux, X Robet, in Proceedings of Interspeech. Intonation conversion from neutral to expressive speech, (2011), pp. 2765–2768.
3. K Nakamura, T Toda, H Saruwatari, K Shikano, Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech. Speech Commun. 54(1), 134–146 (2012).
4. L Deng, A Acero, L Jiang, J Droppo, X Huang, in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). High-performance robust speech recognition using stereo training data, (2001), pp. 301–304.
5. A Kunikoshi, Y Qiao, N Minematsu, K Hirose, in Proceedings of Interspeech. Speech generation from hand gestures based on space mapping, (2009), pp. 308–311.