Affiliation:
1. National Lab of Machine Perception and Center for Information Science, Peking University, Beijing 100871, China
Abstract
In this paper, we extend the Hierarchical Mixture of Experts (HME) to temporal processing and explore it for a substantial problem, that of text-dependent speaker identification. For a specific multiway classification, we propose a generalized Bernoulli density instead of the multinomial logit density to avoid the instability during training. Time-delay technique is applied for spatio-temporal processing in the HME and a combining scheme is presented for combining multiple time-delay HMEs in order to complete a multi-scale analysis for the temporal data. Using the time-delay HME along with the EM algorithm as well as the combination of multiple time-delay HMEs, the speaker identification system has a good performance and yields significantly fast training. We have also addressed some issues about the time-delay techniques in the HME.
Publisher
World Scientific Pub Co Pte Lt
Subject
Computer Networks and Communications,General Medicine
Cited by
12 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Effective Biometric Technology Used with Big Data;Proceedings of Seventh International Congress on Information and Communication Technology;2022-07-12
2. On the Use of Different Speech Representations for Speaker Modeling;IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews);2005-08
3. Boosting Input/Output Hidden Markov Models for Sequence Classification;Lecture Notes in Computer Science;2005
4. Parallel system design for time-delay neural networks;IEEE Transactions on Systems, Man and Cybernetics, Part C (Applications and Reviews);2000-05
5. A MODULAR NEURAL NETWORK ARCHITECTURE FOR PATTERN CLASSIFICATION BASED ON DIFFERENT FEATURE SETS;International Journal of Neural Systems;1999-12