Review on Automatic Lip Reading Techniques-Reference-Cited by-同舟云学术

Review on Automatic Lip Reading Techniques

Published:2018-03-14 Issue:07 Volume:32 Page:1856007
ISSN:0218-0014
Container-title:International Journal of Pattern Recognition and Artificial Intelligence
language:en
Short-container-title:Int. J. Patt. Recogn. Artif. Intell.

Author:

Lu Yuanyao¹^ORCID,Yan Jie¹,Gu Ke¹

Affiliation:

1. School of Electronic and Information Engineering, North China University of Technology, Beijing, P. R. China

Abstract

As a significant component of the Human Computer Interface (HCI), automatic lip reading is designed for the purpose of understanding the content of speech by interpreting the movements of the lips. Although performance of automatic lip reading system is easily affected by challenging conditions such as noise, illumination and low resolution, enormous advancements in the relevant fields accompanied with enhancement in computer capability have improved the robustness of the system, making it more adaptable to the real environment. In this paper, we study the field and gives a detailed discussion on the actuality and the developing level of automatic lip reading. We emphatically introduce the feature extraction and recognition model algorithms. We also compare and analyze the various visual speech databases for their characteristics and functions in speech recognition systems. In addition, we describe the challenges and offer our insights into future research direction of automatic lip reading.

Funder

the National Natural Science Foundation of China

the Beijing Natural Science Foundation of China

the Science and Technology Development Program of Beijing Municipal Education Commission

the Great Wall Scholar Reserved Talent Program of North China University of Technology

Publisher

World Scientific Pub Co Pte Lt

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Software

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0218001418560074

Reference42 articles.

1. Comparison of colour transforms used in lip segmentation algorithms

2. Turning a blind eye to the lexicon: ERPs show no cross-talk between lip-read and lexical context during speech sound processing

3. Statistical Inference for Probabilistic Functions of Finite State Markov Chains

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Survey on Visual Speech Recognition using Deep Learning Techniques;2023 International Conference on Communication System, Computing and IT Applications (CSCITA);2023-03-31

2. Electromyogram‐Based Lip‐Reading via Unobtrusive Dry Electrodes and Machine Learning Methods;Small;2023-01-26

3. Continuous Phoneme Recognition based on Audio-Visual Modality Fusion;2022 International Joint Conference on Neural Networks (IJCNN);2022-07-18

4. Decoding lip language using triboelectric sensors with deep learning;Nature Communications;2022-03-17

5. A Systematic Review on Physiological-Based Biometric Recognition Systems: Current and Future Trends;Archives of Computational Methods in Engineering;2021-02-23