Affiliation:
1. National Institute of Technology, Raipur, India
Abstract
In the current scenario, audio classification and recognition of a particular source of voice is a major challenge when several speakers are speaking at a time. The process of identifying each speaker in an audio segment is called speaker diarization. The major steps involved in speaker diarization are speech detection, speaker change, and speaker merges. Finding and suggesting the best filter is one of the most important task involved in every step of this process. No current researches at present have yet focused on the impact of filter with optimized approaches. In this chapter, a simple yet effective method using homorphism has been implemented to recommend the best filter for any audio classification task for this purpose. After having performed the classification task for a number of commonly used filters, their accuracies have been compared at every step of speaker diarization. In this work, a module using Bayesian information criterion, convolutional neural networks is implemented, and diarization algorithm that performs the task of speaker diarization is proposed.
Reference17 articles.
1. Speaker Diarization: A Review of Recent Research
2. Analysis of a random forests model;G.Biau;Journal of Machine Learning Research,2012
3. A training algorithm for optimal mar-gin classifiers;B. E.Boser;Proceedings of the fifth annual workshop on Computational learning theory,1992
4. Speaker diarization using deep neural network embedding’s;2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),2017
5. Extremely randomized trees
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献