Design and implementation of smart voice assistant and recognizing academic words

Author:

Abougarair Ahmed J,Aburakhis Mohamed KI,Zaroug Mohamed O

Abstract

This paper approaches the use of a Virtual Assistant using neural networks for recognition of commonly used words. The main purpose is to facilitate the users’ daily lives by sensing the voice and interpreting it into action. Alice, which is the name of the assistant, is implemented based on four main techniques: Hot word detection, Voice to Text conversion, Intent recognition, and Text to Voice conversion. Linux is the operating system of choice, for developing and running the assistant because it is in the public domain, also, Linux has been implemented on most Single-board computers. Python is chosen as a development language due to its capabilities and compatibility with various APIs and libraries, which are deemed necessary for the project. The virtual assistant will be required to communicate with IoT devices. In addition, a speech recognition system is created in order to recognize the significant technical words. An artificial neural network (ANN) with different structure networks and training algorithms is utilized in conjunction with the Mel Frequency Cepstral Coefficient (MFCC) feature extraction technique to increase the identification rate effectively and find the optimal performance. For training purposes, the Levenberg-Marquardt (LM) and BGFS Quasi-Newton Resilient Backpropagation are compared using 10 MFCC, utilizing from 10 to 50 neurons increasing in increments of 10 similarly for 13MFCC the training is done utilizing from between 10 to 50 neurons.

Publisher

MedCrave Group Kft.

Subject

Industrial and Manufacturing Engineering

Reference20 articles.

1. Conversational UI - A paradigm shift in business communication. Maruti Techlabs. 2017.

2. Conejos F. Conversational Interfaces: The Guide (2020) Landbot.io. Landbot.io. 2019.

3. Chatbot: What is a Chatbot? Why are Chatbots Important? Expert System. 2020.

4. Chatterjee S, Mandal R, Chakraborty M. A Comparative Analysis of Several Back Propagation Algorithms in Wireless Channel for ANN-Based Mobile Radio Signal Detector. BibSonomy. 2013.

5. Trivedi Vaibhavi, Chetan Singadiya. Isolated Word Speech Recognition Techniques and Algorithms. IJSRD - International Journal for Scientific Research & Development. 2013;1(2):2321-0613.

Cited by 14 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Modelling and Control of Sphere and Cylinder System;2024 IEEE 4th International Maghreb Meeting of the Conference on Sciences and Techniques of Automatic Control and Computer Engineering (MI-STA);2024-05-19

2. Calibrated SVM for Probabilistic Classification of In-Vehicle Voices into Vehicle Commands via Voice-to-Text LLM Transformation;2024 8th International Conference on Smart Cities, Internet of Things and Applications (SCIoT);2024-05-14

3. COLLEGEBOT: Virtual Assistant System for Enquiry Using Natural Language Processing;2024 2nd International Conference on Intelligent Data Communication Technologies and Internet of Things (IDCIoT);2024-01-04

4. Contactless mouse and voice system using ML and AL integration;i-manager's Journal on Computer Science;2024

5. Building a Secure and Transparent Online Conversation Platform;AI and Blockchain Applications in Industrial Robotics;2023-12-29

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3