A novel speaker verification approach featuring multidomain acoustics based on the weighted city block Minkowski distance-Reference-Cited by-同舟云学术

A novel speaker verification approach featuring multidomain acoustics based on the weighted city block Minkowski distance

Published:2024-08-19 Issue: Volume: Page:
ISSN:1225-6463
Container-title:ETRI Journal
language:en
Short-container-title:ETRI Journal

Author:

Jha Khushboo¹^ORCID,Srivastava Sumit¹^ORCID,Jain Aruna¹

Affiliation:

1. Department of Computer Science and Engineering Birla Institute of Technology, Mesra Ranchi India

Abstract

AbstractAccess control is vital in interconnected environments like the Internet of Things, Industry 4.0, and smart connectivity, ensuring authorized access for security. Biometric‐based access, particularly speaker verification (SV), enhances security with unique vocal features, offering nonintrusive authentication with continuous monitoring. Single‐domain features prove insufficient in distinguishing similar traits, prompting latest SV advancements to adopt multidomain‐based speech features. This paradigm addresses the limitations of single‐domain features by amalgamating the merits of individual domains, establishing a cutting‐edge approach. It utilizes cepstral–frequency–time domain feature fusion, achieved via cepstral mean‐variance normalization for generalizability. The weighted city block Minkowski distance is proposed to compare reference and test speech templates. Parameters are computed based on the confusion matrix, template matching distance functions, dynamic acoustic conditions, and additive white Gaussian noise. A deep convolutional neural network classifier is assessed on open‐source LibriSpeech and Speaker in the Wild corpora, surpassing the current methodologies.

Publisher

Wiley

Link

https://onlinelibrary.wiley.com/doi/pdf/10.4218/etrij.2023-0485

Reference39 articles.

1. Secure mmWave‐radar‐based speaker verification for IoT smart home;Dong Y.;IEEE Int. Things J.,2020

2. A Survey of Speaker Recognition: Fundamental Theories, Recognition Methods and Opportunities

3. Frequent-words analysis for forensic speaker comparison

4. Deep speaker embeddings for Speaker Verification: Review and experimental comparison

5. AWLloss: speaker verification based on the quality and difficulty of speech;Liu Q.;IEEE Signal Process. Lett.,2023