SiamTDNN: Enhancing Discriminative Embeddings for Speaker Diarization
-
Published:2023-08-19
Issue:
Volume:
Page:
-
ISSN:0218-1266
-
Container-title:Journal of Circuits, Systems and Computers
-
language:en
-
Short-container-title:J CIRCUIT SYST COMP
Author:
Zhang Runqing1ORCID,
Lu Huijun1,
Cai Dunbo1,
Huang Zhiguo1,
Du Yujian1,
Qian Ling1ORCID,
Zhang Yijun2
Affiliation:
1. Center for Technology Research & Innovation, China Mobile (Suzhou) Software Technology Co., Ltd., Suzhou, Jiangsu, China
2. PaaS, China Mobile (Suzhou) Software Technology Co., Ltd., Suzhou, Jiangsu, China
Abstract
Recent advances in speaker embeddings promote a great development of speaker diarization. However, determining ‘who spoke when’ in the meeting scenarios is still challenging due to similar speaker voices and unknown speaker quantity. In this paper, this research proposes enhanced discriminative features for speaker diarization, including discriminative speaker-specific features based on Siamese networks, and a speaker re-verification method. With Siamese architecture, SiamTDNN, this research first explores latent representations which is capable of modeling intra-class and inter-class differences between speakers, by training with audio pairs. Then, the re-verification method is introduced with a local-global strategy to identify speakers in a multi-person talking scene. Our method provides a novel speaker embedding with enhanced discriminative power for disambiguated speakers and achieves an elevated upper bound on the number of speakers. The proposed speaker embedding achieved an EER of 1.1% and a minDCF of 0.1192 on VoxCeleb1 for the speaker verification task. Extensive experiments on AiShell-4, ICSI, AMI and VoxConverse demonstrate the effectiveness of the proposed method with an average DER reduction of 3% and an RTF of 0.0792.
Funder
The National Key R&D Plan
The research project of China Mobile Communications Group Co., Ltd
Publisher
World Scientific Pub Co Pte Ltd
Subject
Electrical and Electronic Engineering,Hardware and Architecture,Electrical and Electronic Engineering,Hardware and Architecture