LimitAccess: on-device TinyML based robust speech recognition and age classification

Author:

Maayah Marina,Abunada Ahlam,Al-Janahi Khawla,Ahmed Muhammad Ejaz,Qadir Junaid

Abstract

AbstractAutomakers from Honda to Lamborghini are incorporating voice interaction technology into their vehicles to improve the user experience and offer value-added services. Speech recognition systems are a key component of smart cars, enhancing convenience and safety for drivers and passengers. In the future, safety-critical features may rely on speech recognition, but this raises concerns about children accessing such services. To address this issue, the LimitAccess system is proposed, which uses TinyML for age classification and helps parents limit children’s access to critical speech recognition services. This study employs a lite convolutional neural network (CNN) model for two different reasons: First, CNN showed superior accuracy compared to other audio classification models for age classification problems. Second, the lite model will be integrated into a microcontroller to meet its limited resource requirements. To train and evaluate our model, we created a dataset that included child and adult voices of the keyword “open”. The system approach categorizes voices into age groups (child, adult) and then utilizes that categorization to grant access to a car. The robustness of the model was enhanced by adding a new class (recordings) to the dataset, which enabled our system to detect replay and synthetic voice attacks. If an adult voice is detected, access to start the car will be granted. However, if a child’s voice or a recording is detected, the system will display a warning message that educates the child about the dangers and consequences of the improper use of a car. Arduino Nano 33 BLE sensing was our embedded device of choice for integrating our trained, optimized model. Our system achieved an overall F1 score of 87.7% and 85.89% accuracy. LimitAccess detected replay and synthetic voice attacks with an 88% F1 score.

Funder

Qatar University

Publisher

Springer Science and Business Media LLC

Reference56 articles.

1. Cheng P, Roedig U. Personal voice assistant security and privacy—a survey. Proc IEEE. 2022;110(4):476–507.

2. Von Spiczak J, Samset E, Kacher D, Burghart C, Jolesz F, DiMaio S. A voice command interface for real-time interventional MR imaging. Proc ISMRM. 2006.

3. Katangle S, Kharade M, Deosarkar S, Kale GM, Nalbalwar S. Smart home automation-cum agriculture system. In 2020 International Conference on Industry 4.0 Technology (I4Tech), IEEE; 2020. pp. 121–5.

4. Devi SA, Ram MS, Ranganarayana K, Rao DB, Rachapudi V. Smart home system using voice command with integration of esp8266. In 2022 International Conference on Applied Artificial Intelligence and Computing (ICAAIC), IEEE; 2022. pp. 1535–9.

5. Reimer B, Mehler B, Dobres J, Coughlin J. The effects of a production level “voice-command” interface on driver behavior: summary findings on reported workload, physiology, visual attention, and driving performance. Assessing the demands of voice based in-vehicle interfaces. 2013.

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Salv AIoT Platform for Mountain Accidents Prevention and Search and Rescue Missions;2024 IEEE International Conference on Communications Workshops (ICC Workshops);2024-06-09

2. Sound Source Localization and Classification for Emergency Vehicle Siren Detection Using Resource Constrained Systems;2024 34th International Conference Radioelektronika (RADIOELEKTRONIKA);2024-04-17

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3