Computational intelligence in processing of speech acoustics: a survey-Reference-Cited by-同舟云学术

Computational intelligence in processing of speech acoustics: a survey

Published:2022-02-17 Issue:3 Volume:8 Page:2623-2661
ISSN:2199-4536
Container-title:Complex & Intelligent Systems
language:en
Short-container-title:Complex Intell. Syst.

Author:

Singh Amitoj^ORCID,Kaur Navkiran,Kukreja Vinay,Kadyan Virender,Kumar Munish

Abstract

AbstractSpeech recognition of a language is a key area in the field of pattern recognition. This paper presents a comprehensive survey on the speech recognition techniques for non-Indian and Indian languages, and compiled some of the computational models used for processing speech acoustics. An immense number of frameworks are available for speech processing and recognition for languages persisting around the globe. However, a limited number of automatic speech recognition systems are available for commercial use. The gap between the languages being spoken around the globe and the technical support available to these languages are very few. This paper examined major challenges for speech recognition for different languages. Analysis of the literature shows that lack of standard databases availability of minority languages hinder the research recognition research across the globe. When compared with non-Indian languages, the research on speech recognition of Indian languages (except Hindi) has not achieved the expected milestone yet. Combination of MFCC and DNN–HMM classifier is most commonly used system for developing ASR minority languages, whereas in some of the majority languages, researchers are using much advance algorithms of DNN. It has also been observed that the research in this field is quite thin and still more research needs to be carried out, particularly in the case of minority languages.

Publisher

Springer Science and Business Media LLC

Subject

Computational Mathematics,Engineering (miscellaneous),Information Systems,Artificial Intelligence

Link

https://link.springer.com/content/pdf/10.1007/s40747-022-00665-1.pdf

Reference366 articles.

1. Abdel-Hamid O, Mohamed AR, Jiang H, Deng L, Penn G, Yu D (2014) Convolutional neural networks for speech recognition. IEEE/ACM Trans Audio Speech Lang Process 22(10):1533–1545

2. Adda J, Dustmann C, Stevens K (2017) The career costs of children. J Polit Econ 125(2):293–337

3. Adda-Decker M, Lamel L, Adda G, Lavergne T (2011) A first LVCSR system for Luxembourgish, a low-resourced European language. Lang Technol Conf 2011:479–490

4. Adda-Decker M, Boula de Mareuil P, Adda G, Lamel L (2005) Investigating syllabic structures and their variation in spontaneous french. Speech Commun 46:119–139

5. Afify M, Sarikaya R, Kuo HKJ, Besacier L, Gao Y (2006) On the use of morphological analysis for dialectal Arabic speech recognition. In: Ninth international conference on spoken language processing, pp 270–280

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A survey on preprocessing and classification techniques for acoustic scene;Expert Systems with Applications;2023-11

2. Advancing Music Genre Identification Through Deep Learning Techniques;2023 International Conference on Self Sustainable Artificial Intelligence Systems (ICSSAS);2023-10-18

3. Mobile robot: automatic speech recognition application for automation and STEM education;Soft Computing;2023-02-09

4. Annotation Projection-based Dependency Parser Development for Nepali;ACM Transactions on Asian and Low-Resource Language Information Processing;2022-12-27

5. Prosody features based low resource Punjabi children ASR and T-NT classifier using data augmentation;Multimedia Tools and Applications;2022-07-20