Milestones in speaker recognition-Reference-Cited by-同舟云学术

Milestones in speaker recognition

Published:2024-02-15 Issue:3 Volume:57 Page:
ISSN:1573-7462
Container-title:Artificial Intelligence Review
language:en
Short-container-title:Artif Intell Rev

Author:

Sharma R.,Govind D.,Mishra J.,Dubey A. K.,Deepak K. T.,Prasanna S. R. M.

Abstract

AbstractThis article reviews significant research in the domain of speaker recognition, i.e., the task of determining the speaker’s identity from its speech. Unlike conventional review articles, this document strives to be concise and selective, provide a historical context, and reach a wider audience. In this endeavour, a summary of selected key works of every decade is provided which highlights the theme(s) of research of that period. At first, an overview of the humble beginnings of the 1960s and 70s is provided, followed by the key developments in the 80s and 90s. The prime focus of the research community in the 2000s is then discussed, leading to various non-conventional features, modelling techniques, and hybrid or fusion systems. The developments of the last decade (the 2010s), such as the i-vector-based systems, are then discussed. Modern speaker recognition based on Artificial Intelligence (AI), such as the x-vector system, and refinements of the i-vector-based systems using deep neural networks, are then discussed. The article concludes with a concise discussion of the evolving recent trends and allied research in speaker recognition.

Funder

Ministry of Electronics and Information technology

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s10462-023-10688-w.pdf

Reference107 articles.

1. Aharon M, Elad M, Bruckstein A (2006) K-svd: an algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans Signal Process 54(11):4311–4322

2. Atal B (1972) Text-independent speaker recognition. J Acoust Soc Am 52(1A):181–181

3. Atal BS (1974) Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification. J Acoust Soc Am 55(6):1304–1312

4. Bai Z, Zhang X-L (2021) Speaker recognition based on deep learning: an overview. Neural Netw 140:65–99

5. Beek B, Neuberg E, Hodge D (1977) An assessment of the technology of automatic speech recognition for military applications. IEEE Trans Acoust, Speech, Signal Process 25(4):310–322

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Robustness study of speaker recognition based on ECAPA-TDNN-CIFG;Journal of Computational Methods in Sciences and Engineering;2024-08-14

2. Enhancing speaker identification in criminal investigations through clusterization and rank-based scoring;Forensic Science International: Digital Investigation;2024-07

3. Deep Learning for Speaker Recognition: A Comparative Analysis of 1D-CNN and LSTM Models Using Diverse Datasets;2024 4th Interdisciplinary Conference on Electrics and Computer (INTCEC);2024-06-11