Language Recognition Using Latent Dynamic Conditional Random Field Model with Phonological Features-Reference-Cited by-同舟云学术

Language Recognition Using Latent Dynamic Conditional Random Field Model with Phonological Features

Published:2014 Issue: Volume:2014 Page:1-16
ISSN:1024-123X
Container-title:Mathematical Problems in Engineering
language:en
Short-container-title:Mathematical Problems in Engineering

Author:

Boonsuk Sirinoot¹^ORCID,Suchato Atiwong¹,Punyabukkana Proadpran¹,Wutiwiwatchai Chai²,Thatphithakkul Nattanun²

Affiliation:

1. Department of Computer Engineering, Chulalongkorn University, Bangkok 10330, Thailand

2. HLT, National Electronics and Computer Technology Center (NECTEC), Bangkok 10400, Thailand

Abstract

Spoken language recognition (SLR) has been of increasing interest in multilingual speech recognition for identifying the languages of speech utterances. Most existing SLR approaches apply statistical modeling techniques with acoustic and phonotactic features. Among the popular approaches, the acoustic approach has become of greater interest than others because it does not require any prior language-specific knowledge. Previous research on the acoustic approach has shown less interest in applying linguistic knowledge; it was only used as supplementary features, while the current state-of-the-art system assumes independency among features. This paper proposes an SLR system based on the latent-dynamic conditional random field (LDCRF) model using phonological features (PFs). We use PFs to represent acoustic characteristics and linguistic knowledge. The LDCRF model was employed to capture the dynamics of the PFs sequences for language classification. Baseline systems were conducted to evaluate the features and methods including Gaussian mixture model (GMM) based systems using PFs, GMM using cepstral features, and the CRF model using PFs. Evaluated on the NIST LRE 2007 corpus, the proposed method showed an improvement over the baseline systems. Additionally, it showed comparable result with the acoustic system based oni-vector. This research demonstrates that utilizing PFs can enhance the performance.

Funder

Thailand Graduate Institute of Science and Technology

Publisher

Hindawi Limited

Subject

General Engineering,General Mathematics

Link

http://downloads.hindawi.com/journals/mpe/2014/250160.pdf

Reference11 articles.

1. Comparison of four approaches to automatic language identification of telephone speech

2. A Vector Space Modeling Approach to Spoken Language Identification

3. Shifted-Delta MLP Features for Spoken Language Recognition

4. Support vector machines for speaker and language recognition

5. Support vector machines using GMM supervectors for speaker verification

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Gesture Recognition using Latent-Dynamic based Conditional Random Fields and Scalar Features;Journal of Physics: Conference Series;2017-02