Bangla User Adaptive Word Speech Recognition

Author:

Firoze Adnan1,Arifin Md Shamsul1,Rahman Rashedur M.1

Affiliation:

1. Department of Electrical Engineering and Computer Science, North South University, Dhaka, Bangladesh

Abstract

The paper presents Bangla word speech recognition using two novel approaches with a comprehensive analysis. The first approach is based on spectral analysis and fuzzy logic and the second one uses Mel-Frequency Cepstral Coefficients (MFCC) analysis and feed-forward back-propagation neural networks. As human speech is imprecise and ambiguous, fuzzy logic – the base of which is indeed linguistic ambiguity, could serve as a precise tool for analyzing and recognizing human speech. The authors’ systems revolve around the visual representations of voiced signals – the Fourier energy spectrum and the MFCC. The essences of a Fourier energy spectrum and the MFCC are matrices that include information about properties of a sound by storing energy and frequency in discrete time. The decision making process of their systems is based on fuzzy logic and neural networks. Experimental results demonstrate that their fuzzy logic based system is 86% accurate whereas the Artificial Neural Networks (ANN) based system is 90% accurate compared to a commercial Hidden Markov Model (HMM) based speech recognizer that shows 73% accuracy on an average. Moreover, the authors’ research derives that, even though ANN gives a better recognition accuracy than the fuzzy logic based system, the fuzzy logic based system is more accurate when it comes to “more difficult” or “polysyllabic” words. In terms of runtime performance, the fuzzy logic based system outperforms the ANN based Bangla speech recognition system.

Publisher

IGI Global

Subject

General Computer Science

Reference20 articles.

1. A PCA-FBPN Approach for Job Cycle Time Estimation in a Wafer Fabrication Factory

2. Automatic Recognition of Spoken Digits

3. The nature of speech and its interpretation

4. Hasan, M. R., Nath, B., & Alauddin, B. M. (2003). Bengali phoneme recognition: A new approach. In Proceedings of the 6th International Conference on Computing and Information Technology (ICCIT) Conference, Dhaka.

5. Hasnat, M. A., Jabir, M., & Mumit, K. (2007). Isolated and continuous Bangla speech recognition: Implementation, performance and application perspective. In Proceedings of the International Symposium on Natural Language Processing (SNLP) 07, Kasetsart University, Bangok, Thailand.

Cited by 9 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Bengali Speech Recognition: An Overview;2022 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET);2022-09-13

2. Isolated Odia Digit Recognition Using HTK: An Implementation View;2018 2nd International Conference on Data Science and Business Analytics (ICDSBA);2018-09

3. INLP-BPN approach for recommending hotels to a mobile traveler;Journal of Ambient Intelligence and Humanized Computing;2016-09-07

4. Analyzing and forecasting the global CO2 concentration – a collaborative fuzzy–neural agent network approach;Journal of Applied Research and Technology;2015-06

5. An Efficient and Effective Fuzzy Collaborative Intelligence Approach for Cycle Time Estimation in Wafer Fabrication;International Journal of Intelligent Systems;2015-04-10

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3