Speaker Model Clustering to Construct Background Models for Speaker Verification-Reference-Cited by-同舟云学术

Speaker Model Clustering to Construct Background Models for Speaker Verification

Published:2017-03-01 Issue:1 Volume:42 Page:127-135
ISSN:2300-262X
Container-title:Archives of Acoustics
language:
Short-container-title:

Author:

Dişken Gökay,Tüfekci Zekeriya,Çevik Ulus

Abstract

Abstract Conventional speaker recognition systems use the Universal Background Model (UBM) as an imposter for all speakers. In this paper, speaker models are clustered to obtain better imposter model representations for speaker verification purpose. First, a UBM is trained, and speaker models are adapted from the UBM. Then, the k-means algorithm with the Euclidean distance measure is applied to the speaker models. The speakers are divided into two, three, four, and five clusters. The resulting cluster centers are used as background models of their respective speakers. Experiments showed that the proposed method consistently produced lower Equal Error Rates (EER) than the conventional UBM approach for 3, 10, and 30 seconds long test utterances, and also for channel mismatch conditions. The proposed method is also compared with the i-vector approach. The three-cluster model achieved the best performance with a 12.4% relative EER reduction in average, compared to the i-vector method. Statistical significance of the results are also given.

Publisher

Walter de Gruyter GmbH

Subject

Acoustics and Ultrasonics

Link

http://www.degruyter.com/view/j/aoa.2017.42.issue-1/aoa-2017-0014/aoa-2017-0014.pdf

Reference20 articles.

1. Reducing Speaker Model Search Space in Speaker Identification;De Leon;Biometrics Symposium USA,2007

2. Rod - man Joint frame and Gaus - sian selection for text independent speaker verification IEEE International Conference on Acoustics Speech and Signal Processing;Saeidi;USA,2010

3. Deep Neural Network Approaches to Speaker and Lan - guage Recognition Processing;Richardson;IEEE Signal Letters,2015

4. Speaker Verification With Feature - Space MAPLR Parameters;Zhu;IEEE Trans Audio Speech Processing,2011

5. An Effective Speaker Clustering Method using UBM and Ultra - Short Train - ing Utterances of;Hossa;Archives Acoustics,2016