1. Y. Wang, R. Skerry-Ryan, D. Stanton, Y. Wu, R. J. Weiss, N. Jaitly, Z. Yang, Y. Xiao, Z. Chen, S. Bengio, Q. Le, Y. Agiomyrgiannakis, R. Clark, R. A. Saurous, in Interspeech 2017: 20-24 August 2017
2. Stockholm. Tacotron: Towards end-to-end speech synthesis (ISCA, 2017), pp. 4006-4010.
3. J. Shen, R. Pang, R. J. Weiss, M. Schuster, N. Jaitly, Z. Yang, Z. Chen, Y. Zhang, Y. Wang, R. Skerry-Ryan, R. A. Saurous, Y. Agiomyrgiannakis, Y. Wu, in 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP): 15-20 April 2018
4. Canada. Natural tts synthesis by conditioning wavenet on mel spectrogram predictions (IEEE, 2018), pp. 4779-4783.
5. W. Ping, K. Peng, A. Gibiansky, S. O. Arik, A. Kannan, S. Narang, J. Raiman, J. Miller, in 6th International Conference on Learning Representations (ICLR): April 30-May 3, 2018