Self-Supervised Open-Set Speaker Recognition with Laguerre–Voronoi Descriptors
Author:
Ohi Abu Quwsar1ORCID, Gavrilova Marina L.1ORCID
Affiliation:
1. Department of Computer Science, University of Calgary, Calgary, AB T2N1N4, Canada
Abstract
Speaker recognition is a challenging problem in behavioral biometrics that has been rigorously investigated over the last decade. Although numerous supervised closed-set systems inherit the power of deep neural networks, limited studies have been made on open-set speaker recognition. This paper proposes a self-supervised open-set speaker recognition that leverages the geometric properties of speaker distribution for accurate and robust speaker verification. The proposed framework consists of a deep neural network incorporating a wider viewpoint of temporal speech features and Laguerre–Voronoi diagram-based speech feature extraction. The deep neural network is trained with a specialized clustering criterion that only requires positive pairs during training. The experiments validated that the proposed system outperformed current state-of-the-art methods in open-set speaker recognition and cluster representation.
Funder
Natural Sciences and Engineering Research Council (NSERC) Discovery Grant funding NSERC Strategic Partnership Grant University of Calgary Transdisciplinary Connector Funding
Reference38 articles.
1. Balestriero, R., Ibrahim, M., Sobal, V., Morcos, A., Shekhar, S., Goldstein, T., Bordes, F., Bardes, A., Mialon, G., and Tian, Y. (2023). A cookbook of self-supervised learning. arXiv. 2. Chen, H., Gouin-Vallerand, C., Bouchard, K., Gaboury, S., Couture, M., Bier, N., and Giroux, S. (2024). Enhancing Human Activity Recognition in Smart Homes with Self-Supervised Learning and Self-Attention. Sensors, 24. 3. Recent advances in open set recognition: A survey;Geng;IEEE Trans. Pattern Anal. Mach. Intell.,2020 4. Chung, J.S., Huh, J., Mun, S., Lee, M., Heo, H.S., Choe, S., Ham, C., Jung, S., Lee, B.J., and Han, I. (2020). In defence of metric learning for speaker recognition. arXiv. 5. Palo, H.K., and Behera, D. (2020). Critical Approaches to Information Retrieval Research, IGI Global.
|
|