Real Time Speech Recognition based on PWP Thresholding and MFCC using SVM-Reference-Cited by-同舟云学术

Real Time Speech Recognition based on PWP Thresholding and MFCC using SVM

Published:2020-10-26 Issue:5 Volume:10 Page:6204-6208
ISSN:1792-8036
Container-title:Engineering, Technology & Applied Science Research
language:
Short-container-title:Eng. Technol. Appl. Sci. Res.

Author:

Helali W.,Hajaiej Ζ.,Cherif A.

Abstract

The real-time performance of Automatic Speech Recognition (ASR) is a big challenge and needs high computing capability and exhaustive memory consumption. Getting a robust performance against inevitable various difficult situations such as speaker variations, accents, and noise is a tedious task. It’s crucial to expand new and efficient approaches for speech signal extraction features and pre-processing. In order to fix the high dependency issue related to processing succeeding steps in ARS and enhance the extracted features’ quality, noise robustness can be solved within the ARS extraction block feature, removing implicitly the need for further additional specific compensation parameters or data collection. This paper proposes a new robust acoustic extraction approach development based on a hybrid technique consisting of Perceptual Wavelet Packet (PWP) and Mel Frequency Cepstral Coefficients (MFCCs). The proposed system was implemented on a Rasberry Pi board and its performance was checked in a clean environment, reaching 99% average accuracy. The recognition rate was improved (from 80% to 99%) for the majority of Signal-to-Noise Ratios (SNRs) under real noisy conditions for positive SNRs and considerably improved results especially for negative SNRs.

Publisher

Engineering, Technology & Applied Science Research

Reference31 articles.

1. [1] D. Karaboga and E. Kaya, "Adaptive network based fuzzy inference system (ANFIS) training approaches: a comprehensive survey," Artificial Intelligence Review, vol. 52, no. 4, pp. 2263-2293, Dec. 2019.

2. [2] H. A. Yanco, A. Norton, W. Ober, D. Shane, A. Skinner, and J. Vice, "Analysis of Human-robot Interaction at the DARPA Robotics Challenge Trials," Journal of Field Robotics, vol. 32, no. 3, pp. 420-444, May 2015.

3. [3] A. Pereira, C. Oertel, L. Fermoselle, J. Mendelson, and J. Gustafson, "Responsive Joint Attention in Human-Robot Interaction," in 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Nov. 2019, pp. 1080-1087.

4. [4] I. Tiddi, E. Bastianelli, E. Daga, M. d'Aquin, and E. Motta, "Robot-City Interaction: Mapping the Research Landscape-A Survey of the Interactions Between Robots and Modern Cities," International Journal of Social Robotics, vol. 12, no. 2, pp. 299-324, May 2020.

5. [5] Y. Zheng, Y. Liu, and J. H. L. Hansen, "Navigation-orientated natural spoken language understanding for intelligent vehicle dialogue," in 2017 IEEE Intelligent Vehicles Symposium (IV), Jun. 2017, pp. 559-564.

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An automated system to distinguish between Corona and Viral Pneumonia chest diseases based on image processing techniques;Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization;2023-09-30

2. A Novel Approach on Speaker Gender Identification and Verification Using DWT First Level Energy and Zero Crossing;Engineering, Technology & Applied Science Research;2022-12-15

3. Efficient multimodal cancelable biometric system based on steganography and cryptography;Iran Journal of Computer Science;2022-12-03

4. Environmental Noise Reduction based on Deep Denoising Autoencoder;Engineering, Technology & Applied Science Research;2022-12-01

5. Spectral Features based Robust Speech Spoofing Detection System;2022 International Conference on Advances in Computing, Communication and Materials (ICACCM);2022-11-10