VISUAL SPEECH RECOGNITION USING DYNAMIC FEATURES AND SUPPORT VECTOR MACHINES-Reference-Cited by-同舟云学术

VISUAL SPEECH RECOGNITION USING DYNAMIC FEATURES AND SUPPORT VECTOR MACHINES

Published:2008-07 Issue:03 Volume:08 Page:419-437
ISSN:0219-4678
Container-title:International Journal of Image and Graphics
language:en
Short-container-title:Int. J. Image Grap.

Author:

YAU WAI CHEE¹,KUMAR DINESH KANT¹,ARJUNAN SRIDHAR POOSAPADI¹

Affiliation:

1. School of Electrical and Computer Engineering, RMIT University, GPO Box 2476V, Melbourne, Victoria 3001, Australia

Abstract

This paper presents a vision based technique to identify the unspoken phones using a small camera that is located on the headset of the speaker. The system is based on temporal integration of the video data to generate motion history image (MHI). The paper proposes the use of global features to classify the MHI and compares the use of image moments with Discrete Cosine Transform (DCT). A comparison between Zernike moments (ZM) with DCT indicates that while the accuracy of classification for both techniques is very comparable (96% for ZM and 94% for DCT) when there is no relative motion between the camera and the mouth, ZM is resilient to rotation of the camera and continues to gives good results despite rotation but DCT is sensitive to rotation. Based on the accuracy of the system and its resilience to movement artefacts such as rotation, the authors propose the use of such a system for human computer interface. Such a system could be invaluable when it is important to communicate without making a sound, such as giving passwords when in an open office or in public spaces.

Publisher

World Scientific Pub Co Pte Lt

Subject

Computer Graphics and Computer-Aided Design,Computer Science Applications,Computer Vision and Pattern Recognition

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0219467808003167

Reference17 articles.

1. Audiovisual speech processing

2. Hearing lips and seeing voices

3. Analysis of Lip Geometric Features for Audio-Visual Speech Recognition

4. The recognition of human movement using temporal templates

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Shuffle Attention U-Net for Speech Enhancement in Time Domain;International Journal of Image and Graphics;2023-03-31

2. Privacy-Preserving Multi-Class Support Vector Machine Model on Medical Diagnosis;IEEE Journal of Biomedical and Health Informatics;2022-07

3. Effect of Various Visual Speech Units on Language Identification Using Visual Speech Recognition;International Journal of Image and Graphics;2020-10

4. Automatic visual speech segmentation and recognition using directional motion history images and Zernike moments;The Visual Computer;2012-09-13

5. VISUAL SPEECH RECOGNITION USING OPTICAL FLOW AND SUPPORT VECTOR MACHINES;International Journal of Computational Intelligence and Applications;2011-06