1. Ghadage, Y. H. & Shelke, S. D. Speech to Text Conversion for Multilingual Languages (2016), 236–240.
2. Morgan, N. Deep and wide: Multiple layers in automatic speech recognition. IEEE Trans. Audio, Speech Lang. Process. 20 (2012), 7– 13.
3. Tian, C., Ji, W. & Yuan, Y. Auxiliary Multimodal LSTM for Audiovisual Speech Recognition and Lipreading(2017), 1–9.
4. How many people used YouTube in 2021,backlinko.com/youtube-users.
5. A. D. Simons and S. J. Cox. Generation of mouth shapes for a synthetic talking head. Proceedings of the Institute of Acoustics, Autumn Meeting, 12(January):475482, 1990