Development of Output Correction Methodology for Long Short Term Memory-Based Speech Recognition-Reference-Cited by-同舟云学术

Development of Output Correction Methodology for Long Short Term Memory-Based Speech Recognition

Published:2019-08-06 Issue:15 Volume:11 Page:4250
ISSN:2071-1050
Container-title:Sustainability
language:en
Short-container-title:Sustainability

Author:

Arslan Recep Sinan^ORCID,Barışçı Necaattin

Abstract

This paper presents a correction methodology for Long Short Term Memory (LSTM) based speech recognition. A strategy that validates with a reference database was developed for LSTM. It is conceptually simple but requires a large keyword database to match test templates. The correction method is based on the “most matching method” that is finding the word in which the system output is closest among the “Referenced Template Database”. Each LSTM model recognition output was corrected with the proposed new concept. Thus, system recognition performance was improved by correcting faulty outputs. The effectiveness, efficiency, and contribution of this approach to system performance were demonstrated by experiments. Tests carried out using different speech-text datasets and LSTM models yielded an average performance increase of 2.25%. With some advanced models, this ratio rises to 3.84%.

Publisher

MDPI AG

Subject

Management, Monitoring, Policy and Law,Renewable Energy, Sustainability and the Environment,Geography, Planning and Development

Link

https://www.mdpi.com/2071-1050/11/15/4250/pdf

Reference65 articles.

1. Daily Human Activity Recognition Using Depth Silhouettes and R Transformation for Smart Home;Jalal,2011

2. Advancements of Image Processing and Vision in Healthcare

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Exhaustive Study into Machine Learning and Deep Learning Methods for Multilingual Cyberbullying Detection in Bangla and Chittagonian Texts;Electronics;2024-04-26

2. Singer identification model using data augmentation and enhanced feature conversion with hybrid feature vector and machine learning;EURASIP Journal on Audio, Speech, and Music Processing;2024-02-26

3. Brain tumor recognition from multimodal magnetic resonance images using wavelet texture features and optimized artificial neural network;Multimedia Tools and Applications;2024-02-10

4. English Speech Emotion Classification Based on Multi-Objective Differential Evolution;Applied Sciences;2023-11-13

5. Chaotic time series prediction of nonlinear systems based on various neural network models;Chaos, Solitons & Fractals;2023-10