Spoken language identification using i-vectors, x-vectors, PLDA and logistic regression-Reference-Cited by-同舟云学术

Spoken language identification using i-vectors, x-vectors, PLDA and logistic regression

Published:2021-08-01 Issue:4 Volume:10 Page:2237-2244
ISSN:2302-9285
Container-title:Bulletin of Electrical Engineering and Informatics
language:
Short-container-title:Bulletin EEI

Author:

Abdurrahman Ahmad Iqbal,Zahra Amalia

Abstract

In this paper, i-vector and x-vector is used to extract the features from speech signal from local Indonesia languages, namely Javanese, Sundanese and Minang languages to help classifier identify the language spoken by the speaker. Probabilistic linear discriminant analysis (PLDA) are used as the baseline classifier and logistic regression technique are used because of prior studies showing logistic regression has better performance than PLDA for classifying speech data. Once these features are extracted. The feature is going to be classified using the classifier mentioned before. In the experiment, we tried to segment the test data to three segment such as 3, 10, and 30 seconds. This study is expanded by testing multiple parameters on the i-vector and x-vector method then comparing PLDA and logistic regression performance as its classifier. The x-vector has better score than i-vector for every segmented data while using PLDA as its classifier, except where the i-vector and x-vector is using logistic regression, i-vector still has better accuracy compared to x-vector.

Publisher

Institute of Advanced Engineering and Science

Subject

Electrical and Electronic Engineering,Control and Optimization,Computer Networks and Communications,Hardware and Architecture,Instrumentation,Information Systems,Control and Systems Engineering,Computer Science (miscellaneous)

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A review of social background profiling of speakers from speech accents;PeerJ Computer Science;2024-04-16

2. Datasets Collection Framework for Low-Resourced Languages in South Africa;2024 Conference on Information Communications Technology and Society (ICTAS);2024-03-07

3. Common latent representation learning for low-resourced spoken language identification;Multimedia Tools and Applications;2023-09-26

4. The Effect of Synthetic Voice Data Augmentation on Spoken Language Identification on Indian Languages;IEEE Access;2023

5. Spoken Language Recognition with Cluster-Based Modeling;ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2022-05-23