AUDD: Audio Urdu Digits Dataset for Automatic Audio Urdu Digit Recognition-Reference-Cited by-同舟云学术

AUDD: Audio Urdu Digits Dataset for Automatic Audio Urdu Digit Recognition

Published:2021-09-23 Issue:19 Volume:11 Page:8842
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Chandio Aisha,Shen Yao,Bendechache Malika^ORCID,Inayat Irum^ORCID,Kumar Teerath

Abstract

The ongoing development of audio datasets for numerous languages has spurred research activities towards designing smart speech recognition systems. A typical speech recognition system can be applied in many emerging applications, such as smartphone dialing, airline reservations, and automatic wheelchairs, among others. Urdu is a national language of Pakistan and is also widely spoken in many other South Asian countries (e.g., India, Afghanistan). Therefore, we present a comprehensive dataset of spoken Urdu digits ranging from 0 to 9. Our dataset has 25,518 sound samples that are collected from 740 participants. To test the proposed dataset, we apply different existing classification algorithms on the datasets including Support Vector Machine (SVM), Multilayer Perceptron (MLP), and flavors of the EfficientNet. These algorithms serve as a baseline. Furthermore, we propose a convolutional neural network (CNN) for audio digit classification. We conduct the experiment using these networks, and the results show that the proposed CNN is efficient and outperforms the baseline algorithms in terms of classification accuracy.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/11/19/8842/pdf

Reference44 articles.

1. ImageNet Large Scale Visual Recognition Challenge

2. Dispersed federated learning: Vision, taxonomy, and future directions;Khan;arXiv,2020

3. Federated Learning for Internet of Things: Recent Advances, Taxonomy, and Open Challenges

4. Text Classification Algorithms: A Survey

5. Techniques for text classification: Literature review and current trends;Jindal;Webology,2015

Cited by 24 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. AI-assisted Segmentation Tool for Brain Tumor MR Image Analysis;Journal of Imaging Informatics in Medicine;2024-07-08

2. Efficient Paddy Grain Quality Assessment Approach Utilizing Affordable Sensors;AI;2024-05-14

3. Amharic spoken digits recognition using convolutional neural network;Journal of Big Data;2024-05-04

4. A Multifaceted Feature Extraction Approach for Noise-Robust Punjabi Spoken Digit Recognition System Under Low-Resource Conditions;2024 11th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO);2024-03-14

5. Brain tumor segmentation based on optimized convolutional neural network and improved chimp optimization algorithm;Computers in Biology and Medicine;2024-01