Affiliation:
1. Chongqing University of Posts and Telecommunications
Abstract
Although a great success has been achieved under the environment of lab where the training data is sufficient and the surroundings are quiet, speaker identification (SI) in practical use still remains a challenge because of the complicated environment. To tackle this challenge, a hybrid system of Gaussian mixture model-support vector machines (GMM-SVM) is proposed in this paper. SVM can do well with less data but is computationally expensive while GMM is computationally inexpensive but needs more data to perform adequately. In this paper, SVM and GMM are parallel in both the training and testing phase, the judgment of them are fused to make the final decision: the person with the largest score is identified as the true speaker. Universal background model (UBM) is used in GMM to improve the recognition accuracy. The system is evaluated on part of the TIMIT database and a Chinese database which is recorded by our own. Experiments have shown that the method proposed in this paper is effective. The system has better performance and robustness than the baseline systems.
Publisher
Trans Tech Publications, Ltd.
Reference9 articles.
1. Zhaohui Wu, Yingchun Yang: Models and Methods for Speaker Recognition (Tsinghua University Publications, Beijing 2009).
2. R. Togneri and D. Pullella: An Overview of Speaker Identification: Accuracy and Robustness Issues. Circuits and Systems Magazine, IEEE. Vol. 11, No. 2, pp.23-61, (2011).
3. Jun Wu: A speaker recognition system based on MFCC and SCHMM. ICT and Energy Efficiency and Workshop on Information Theory and Security (CIICT 2012), pp.88-92, (2012).
4. Campbell, W., Campbell, J., Reynolds, D., Singer, E. and Torres-Carrasquillo, P.: Support vector machines for speaker and language recognition. Computer Speech and Language 20, 2-3 (April 2006), pp.210-229.
5. Li Zhao: Speech Signal Processing (China Machine Press, Beijing 2009).
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献