1. Ishi, C., C. Liu, H. Ishiguro, and Hagita, N. 2012. Evaluation of formant-based lip motion generation in tele-operated humanoid robots. In 2012 IEEE/RSJ international conference on intelligent robots and systems (IROS 2012), 2377–2382.
2. Cohen, M., and D. Massaro. 1993. Modeling coarticulation in synthetic visual speech. In Models and techniques in computer animation.
3. Tamura, M., S. Kondo, T. Masuko, and T. Kobayashi. 1998. Text-to-visual speech synthesis based on parameter generation from HMM. In Proceedings of ICASSP98, 3745–3748.
4. Hong, P., Z. Wen, and T. Huang. 2002. Real-time speech-driven face animation with expressions using neural networks. IEEE Transactions on Neural Networks 13 (4): 916–927.
5. Beskow, J., and M. Nordenberg, 2005. Data-driven synthesis of expressive visual speech using an MPEG-4 talking head. In Proceedings of interspeech 2005, 793–796.