Affiliation:
1. Mangalore Institute of Technology and Engineering, Karnataka, India
Abstract
Automatic speech recognition (ASR) has gained wide popularity in last decade. Various devices like mobile phones, computers, vehicles, and audio/video players are now being equipped with ASR technology. The increasing use and dependence on ASR technology leads to research enhancements and opportunities in this domain. This chapter provides a detailed review of various advancements in ASR systems development. It highlights history of speech recognition followed by detailed insight into recent advancements and industry leaders providing latest solutions. ASR framework has been discussed in detail which includes feature extraction techniques, acoustic modeling techniques, and language modeling techniques. The chapter also lists various popular data sets available and discusses generation of new data sets. This work will be helpful for the researchers who are new to this field and are exploring development of new speech recognition techniques.
Reference74 articles.
1. Alexa Voice Service Overview(v20160207) | Alexa Voice Service. (n.d.). Retrieved May 15, 2022, from https://developer.amazon.com/docs/alexa-voice-service/api-overview.html
2. A new model of LPC excitation for producing natural-sounding speech at low bit rates
3. An IBM PC based large-vocabulary isolated-utterance speech recognizer
4. A Maximum Likelihood Approach to Continuous Speech Recognition
5. Baker, J. (1973). Machine-Aided Labeling of Connected Speech. In Working Papers in Speech Recognition--II. Computer Science Department, Carnegie Mellon University. Retrieved from https://apps.dtic.mil/docs/citations/AD0770633