話し方種別情報を含むテキスト対話を活用した表現豊かなテキスト音声合成-Reference-Cited by-同舟云学术

話し方種別情報を含むテキスト対話を活用した表現豊かなテキスト音声合成

Published:2023-05-01 Issue:3 Volume:38 Page:F-MA7_1-12
ISSN:1346-0714
Container-title:Transactions of the Japanese Society for Artificial Intelligence
language:en
Short-container-title:Transactions of the Japanese Society for Artificial Intelligence

Author:

Homma Yukinori¹,Kanagawa Hiroki¹,Kobayashi Nozomi¹,Ijima Yusuke¹,Saito Kuniko¹

Affiliation:

1. NTT Human Information Laboratories, NTT Corporation

Publisher

Japanese Society for Artificial Intelligence

Subject

Artificial Intelligence,Software

Link

https://www.jstage.jst.go.jp/article/tjsai/38/3/38_38-3_F-MA7/_pdf

Reference41 articles.

1. [Adiwardana 20] Adiwardana, D., Luong, M.-T., So, D. R., Hall, J., Fiedel, N., Thoppilan, R., Yang, Z., Kulshreshtha, A., Nemade, G., Lu, Y., and Le, Q. V.: Towards a human-like open-domain chatbot, CoRR (2020)

2. [Akuzawa 18] Akuzawa, K., Iwasawa, Y., and Matsuo, Y.: Expressive speech synthesis via modeling expressions with variational autoencoder, in Proc. Interspeech, pp. 3067–3071 (2018)

3. [Cai 21] Cai, X., Dai, D., Wu, Z., Li, X., Li, J., and Meng, H.: Emotion controllable speech synthesis using emotion-unlabeled dataset with the assistance of cross-domain speech emotion recognition, in Proc. ICASSP, pp. 5734–5738 (2021)

4. [Church 90] Church, K. W. and Hanks, P.: Word association norms, mutual information, and lexicography, Computational Linguistics, Vol. 16, No. 1, pp. 22–29 (1990)

5. [Cui 21] Cui, C., Ren, Y., Liu, J., Chen, F., Huang, R., Lei, M., and Zhao, Z.: EMOVIE: A mandarin emotion speech dataset with a simple emotional text-to-speech model, in Proc. Interspeech, pp. 2766–2770 (2021)