Recognizing Five Major Dialects in Indonesia Based on MFCC and DRNN-Reference-Cited by-同舟云学术

Recognizing Five Major Dialects in Indonesia Based on MFCC and DRNN

Published:2021-03-01 Issue:1 Volume:1844 Page:012003
ISSN:1742-6588
Container-title:Journal of Physics: Conference Series
language:
Short-container-title:J. Phys.: Conf. Ser.

Author:

Tawaqal B,Suyanto S

Abstract

Abstract Dialect is a variation of the language used by a group of people, sometimes in a particular region. It plays an essential role in automatic speech recognition (ASR). In general, an ASR gives high accuracy for a dialect-specific case, but it obtains a low accuracy for the multi-dialect application, such as for the Indonesian language that has hundreds of dialects. In this research, a system to recognize various dialects in Indonesia is developed. First, an utterance is preprocessed using both normalization and framing. Second, its features are then extracted using the Mel frequency cepstrum coefficients (MFCC), which is one of the feature extraction methods for the best acoustic signals. Finally, a deep recurrent neural network (DRNN) is used to learn and classify dialect characteristics. Evaluation of the dataset of five major dialects in Indonesia shows that the greater the Epoch and Bath Size, the greater the accuracy produced by the DRNN. However, accuracy is not directly proportional to the value of both parameters. The Epoch of 30 and Batch Size of 30 are the optimum parameters that yield the highest accuracy of 87.0% for the training set. Evaluation of the testing set shows that it gives an accuracy of 85.4% for the unseen dialects.

Publisher

IOP Publishing

Subject

General Physics and Astronomy

Link

https://iopscience.iop.org/article/10.1088/1742-6596/1844/1/012003/pdf

Reference38 articles.

1. Context and Text;Shen;Theory Pract. Lang. Stud.,2012

2. Automatic dialect and accent recognition and its application to speech recognition;Biadsy,2011

3. A Highly Adaptive Acoustic Model for Accurate Multi-dialect Speech Recognition

4. Java and Sunda dialect recognition from Indonesian speech using GMM and I-Vector

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Dialect classification based on the speed and the pause of speech utterances*;Phonetics and Speech Sciences;2023-06

2. Mel Frequency Cepstral Coefficient and its Applications: A Review;IEEE Access;2022