1. Kyunghyun Cho, Bart van Merrienboer, Çaglar Gülçehre, Dzmitry Bahdanau, Fethi Bougares, Holger Schwenk, Yoshua Bengio, Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation, in: EMNLP, 2014.
2. Yuxuan Wang, R.J. Skerry-Ryan, Daisy Stanton, Yonghui Wu, Ron J. Weiss, Navdeep Jaitly, Zongheng Yang, Ying Xiao, Zhifeng Chen, Samy Bengio, et al., Tacotron: Towards End-to-End Speech Synthesis, in: Proc. Interspeech 2017, 2017, pp. 4006–4010.
3. Fastspeech: Fast, robust and controllable text to speech;Ren;Adv. Neural Inf. Process. Syst.,2019
4. Yi Ren, Chenxu Hu, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, Tie-Yan Liu, FastSpeech 2: Fast and High-Quality End-to-End Text to Speech, in: International Conference on Learning Representations, 2020.
5. Enhancing the locality and breaking the memory bottleneck of transformer on time series forecasting;Li;Adv. Neural Inf. Process. Syst.,2019