1. [1] T. Masuko, K. Tokuda, T. Kobayashi, and, S. Imai, “Speech synthesis from HMMs using dynamic features,” Proc. ICASSP, pp.389-392, 1996.
2. [2] T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, “Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis,” Proc. Eurospeech, pp.2347-2350, 1999.
3. [3] K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, “Speech parameter generation algorithms for HMM-based speech synthesis,” Proc. ICASSP, pp.1315-1318, 2000.
4. [4] J.-T. Chien and C.-H. Chueh, “Joint acoustic and language modeling for speech recognition,” Speech Commun., vol.52, no.3, pp.223-235, 2010.
5. [5] A. Parlikar, A. Black, and S. Vogel, “Improving speech synthesis of machine translation output,” Proc. Interspeech, pp.194-197, 2010.