1. [Adiwardana 20] Adiwardana, D., Luong, M.-T., So, D. R., Hall, J., Fiedel, N., Thoppilan, R., Yang, Z., Kulshreshtha, A., Nemade, G., Lu, Y., and Le, Q. V.: Towards a human-like open-domain chatbot, CoRR (2020)
2. [Akuzawa 18] Akuzawa, K., Iwasawa, Y., and Matsuo, Y.: Expressive speech synthesis via modeling expressions with variational autoencoder, in Proc. Interspeech, pp. 3067–3071 (2018)
3. [Cai 21] Cai, X., Dai, D., Wu, Z., Li, X., Li, J., and Meng, H.: Emotion controllable speech synthesis using emotion-unlabeled dataset with the assistance of cross-domain speech emotion recognition, in Proc. ICASSP, pp. 5734–5738 (2021)
4. [Church 90] Church, K. W. and Hanks, P.: Word association norms, mutual information, and lexicography, Computational Linguistics, Vol. 16, No. 1, pp. 22–29 (1990)
5. [Cui 21] Cui, C., Ren, Y., Liu, J., Chen, F., Huang, R., Lei, M., and Zhao, Z.: EMOVIE: A mandarin emotion speech dataset with a simple emotional text-to-speech model, in Proc. Interspeech, pp. 2766–2770 (2021)