1. Mingjian Chen , Xu Tan , Bohan Li , Yanqing Liu , Tao Qin , Sheng Zhao , and Tie-Yan Liu . 2021 . Adaspeech: Adaptive text to speech for custom voice. arXiv preprint arXiv:2103.00993 (2021). Mingjian Chen, Xu Tan, Bohan Li, Yanqing Liu, Tao Qin, Sheng Zhao, and Tie-Yan Liu. 2021. Adaspeech: Adaptive text to speech for custom voice. arXiv preprint arXiv:2103.00993 (2021).
2. Nanxin Chen , Yu Zhang , Heiga Zen , Ron J Weiss , Mohammad Norouzi , and William Chan . 2020 . WaveGrad: Estimating Gradients for Waveform Generation . In Proc. of ICLR. Nanxin Chen, Yu Zhang, Heiga Zen, Ron J Weiss, Mohammad Norouzi, and William Chan. 2020. WaveGrad: Estimating Gradients for Waveform Generation. In Proc. of ICLR.
3. Antonia Creswell , Tom White , Vincent Dumoulin , Kai Arulkumaran , Biswa Sengupta , and Anil A Bharath . 2018. Generative adversarial networks: An overview . IEEE Signal Processing Magazine ( 2018 ). Antonia Creswell, Tom White, Vincent Dumoulin, Kai Arulkumaran, Biswa Sengupta, and Anil A Bharath. 2018. Generative adversarial networks: An overview. IEEE Signal Processing Magazine (2018).
4. Chenye Cui , Yi Ren , Jinglin Liu , Feiyang Chen , Rongjie Huang , Ming Lei , and Zhou Zhao . 2021 . EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model. arXiv preprint arXiv:2106.09317 (2021). Chenye Cui, Yi Ren, Jinglin Liu, Feiyang Chen, Rongjie Huang, Ming Lei, and Zhou Zhao. 2021. EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model. arXiv preprint arXiv:2106.09317 (2021).
5. Prafulla Dhariwal and Alex Nichol . 2021. Diffusion models beat gans on image synthesis. arXiv preprint arXiv:2105.05233 ( 2021 ). Prafulla Dhariwal and Alex Nichol. 2021. Diffusion models beat gans on image synthesis. arXiv preprint arXiv:2105.05233 (2021).