1. Adetunmbi OA, Obe OO, Iyanda JN (2016) Development of standard Yoruba speech-to-text system using HTK. Int J Speech Technol 19(4):929–944
2. Ahkuputra V, Jitapunkul S, Jittiwarangkul N, Maneenoi E, Kasuriya S (1988) A comparison of Thai speech recognition systems using hidden markov model, neural network, and fuzzy-neural network. In: Proceedings of the 5th international conference on spoken language processing (ICSLP), vol 3, pp 715–717
3. Amodei D, Ananthanarayanan S, Anubhai R, Bai J, Battenberg E, Case C, Casper J, Catanzaro B, Cheng Q, Chen G, Chen J, Chen J, Chen Z, Chrazanowski M, Coates A, Diamos G, Ding K, Du N, Elsen E, Engel J, Fang W, Jiang B, Ju C, Jun B, Legresley P, Lin L, Liu J, Liu Y, Li W, Li X, Ma D, Narang S, Ng A, Ozair S, Peng Y, Prenger R, Qian S, Srinet K, Sriram A, Tang H, Tang L, Wang C, Wang J, Wang K, Wang Yi, Wang Z, Wang Z, Wu S, Wei L, Xiao B, Xie W, Xie Y, Yogatama D, Yuan B, Zhan J, Zhu Z (2016) Deep speech 2: end-to-end speech recognition in English and Mandarin. In: Proceedings of the 33rd international conference on machine learning (ICML), New York, vol 48, pp 173–182
4. Arora A, Kadyan V, Singh A (2019) Effect of tonal features on various dialectal variations of Punjabi language. In: Proceedings of the conference on advances in signal processing and communication, pp 467–475
5. Besacier L, Le VB, Boitet C, Berment V, 2006 ASR and translation for under-resourced languages. In: Proceedings of the international conference on acoustics, speech and signal processing, Toulouse, France, vol 5, pp 1221–1224