1. Atrey, P. K., Hossain, M. A., Saddik, A. E., & Kankanhalli, M. S. (2010). Multimodal Fusion for Multimedia Analysis: A Survey. Multimedia Systems, 16(6), 345–379.
2. Auburn, R., Baggia, P., & Scott, M. (2011, July). VoiceBrowser CallControl: CCXML Version 1.0, W3C Recommendation. http://www.w3.org/TR/2011/REC-ccxml-20110705/
3. Baggia, P., Burnett, D. C., Carter, J., Dahl, D. A., McCobb, G., & Raggett, D. (2009, February). EMMA: Extensible MultiModal Annotation Markup language, W3C Recommendation. http://www.w3.org/TR/2009/REC-emma-20090210/
4. Barnett, J., Akolkar, R., Auburn, R., Bodell, M., Burnett, D. C., Carter, J., . . . Rosenthal, N. (2014, May). State Chart XML (SCXML): State Machine Notation for Control Abstraction (W3C Working Draft). W3C. (http://www.w3.org/TR/2014/WD-scxml-20140529/)