Real Time Raspberry Pi based System for Linguistic Content Recognition from Speech-Reference-Cited by-同舟云学术

Real Time Raspberry Pi based System for Linguistic Content Recognition from Speech

Published:2023-08-07 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

A Revahi¹,N Sasikaladevi¹

Affiliation:

1. SASTRA Deemed University: Shanmugha Arts Science Technology and Research Academy

Abstract

Abstract Recognizing linguistic information from speech has found applications in interpretation of language in which the utterance is spoken and the system could be used as a translator to convert sentence spoken in one language into another language meaningfully. Real time implementation of language identification (LID) from speech requires the speech to be fed from the Raspberry Pi board used in the transmitter section and the Raspberry Pi board in the receiver section receives it and given to the system for identifying the language of the speech. This system requires the training phase in which two dimensional spectrogram features are derived from the training set of speeches and given to the CNN layered architecture for creating templates for languages. Testing phase involves the transmission of speech from the memory card of the Raspberry Pi board in transmitter system. Raspberry Pi board in the receiver receives it and given to the system in receiver section. Two dimensional spectrogram features are derived for test speech and given to the CNN templates and based on the similarity index, test language is interpreted. This system is implemented using spectrogram, Melspectrogram and ERB spectrogram as features and CNN for modeling and classification of languages. Validation error is 1.4%, 1.8% and 3% for spectrogram, Melspectrogram and ERB spectrogram based systems respectively and decision level fusion classifier gives 0.9% as validation error. This system can be implemented in hardware by using Raspberry Pi board. This automated real time multilingual language identification system would be useful in forensic department and defense sectors to identify the persons belonging to any region or speaking in any language.

Publisher

Research Square Platform LLC

Reference32 articles.

1. Recognition of Spoken Languages from Acoustic Speech Signals Using Fourier Parameters;Srinivas NSS;Circuits Syst Signal Process,2019

2. Unsupervised Speech Signal-to-Symbol Transformation for Language Identification;Bhati S;Circuits Syst Signal Process,2020

3. A Pre-classification-Based Language Identification for Northeast Indian Languages Using Prosody and Spectral Features;China Bhanja C;Circuits Syst Signal Process,2019

4. Bottleneck Feature-Based Hybrid Deep Autoencoder Approach for Indian Language Identification;Das HS;Arab J Sci Eng,2020

5. Isolated word language identification system with hybrid features from a deep belief network;Sangwan P;Int J Communication Syst,2020