Audio-visual speech recognition using deep learning-Reference-Cited by-同舟云学术

Audio-visual speech recognition using deep learning

Published:2014-12-20 Issue:4 Volume:42 Page:722-737
ISSN:0924-669X
Container-title:Applied Intelligence
language:en
Short-container-title:Appl Intell

Author:

Noda Kuniaki,Yamaguchi Yuki,Nakadai Kazuhiro,Okuno Hiroshi G.,Ogata Tetsuya

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence

Link

http://link.springer.com/content/pdf/10.1007/s10489-014-0629-7

Reference52 articles.

1. Abdel-Hamid O, Jiang H. (2013) Rapid and effective speaker adaptation of convolutional neural network based models for speech recognition. In: Proceedings of the 14th Annual Conference of the International Speech Communication Association. Lyon, France

2. Abdel-Hamid O, rahman Mohamed A, Jiang H, Penn G (2012) Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition. In: Proceedings of the IEEE International Conference on Acoustics, Speech,and Signal Processing, Kyoto, pp 4277–4280

3. Aleksic PS, Katsaggelos AK (2004) Comparison of low- and high-level visual features for audio-visual continuous automatic speech recognition. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, vol 5, Montreal, pp 917–920

4. Barker J, Berthommier F (1999) Evidence of correlation between acoustic and visual features of speech. In: Proceedings of the 14th International Congress of Phonetic Sciences, San Francisco , pp 5–9

5. Bengio Y (2009) Learning deep architectures for AI. Found Trends Mach Learn 2(1):1–127

Cited by 399 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An efficient multi-modal sensors feature fusion approach for handwritten characters recognition using Shapley values and deep autoencoder;Engineering Applications of Artificial Intelligence;2024-12

2. Automatic lip-reading classification using deep learning approaches and optimized quaternion meixner moments by GWO algorithm;Knowledge-Based Systems;2024-11

3. AI-based visual speech recognition towards realistic avatars and lip-reading applications in the metaverse;Applied Soft Computing;2024-10

4. Audio–visual speech recognition based on regulated transformer and spatio–temporal fusion strategy for driver assistive systems;Expert Systems with Applications;2024-10

5. TextJuggler: Fooling text classification tasks by generating high-quality adversarial examples;Knowledge-Based Systems;2024-09