An investigation into the reliability of speaker recognition schemes: analysing the impact of environmental factors utilising deep learning techniques-Reference-Cited by-同舟云学术

An investigation into the reliability of speaker recognition schemes: analysing the impact of environmental factors utilising deep learning techniques

Published:2024-01-06 Issue:1 Volume:71 Page:
ISSN:1110-1903
Container-title:Journal of Engineering and Applied Science
language:en
Short-container-title:J. Eng. Appl. Sci.

Author:

Khazaleh Omar Ratib,Khrais Leen Ahmed

Abstract

AbstractThis paper studies the performance and reliability of deep learning-based speaker recognition schemes under various recording situations and background noise presence. The study uses the Speaker Recognition Dataset offered in the Kaggle website, involving audio recordings from different speakers, and four scenarios with various combinations of speakers. In the first scenario, the scheme achieves discriminating capability and high accuracy in identifying speakers without taking into account outside noise, having roughly one area under the ROC curve. Nevertheless, in the second scenario, with background noise added to the recording, accuracy decreases, and misclassifications increase. However, the scheme still reveals good discriminating power, with ROC areas ranging from 0.77 to 1.

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1186/s44147-023-00351-0.pdf

Reference24 articles.

1. Jadhav S, Karpe S, Das S (2021) Sound classification using python. In: ITM Web of Conferences. EDP Sciences, vol. 40, p 03024

2. Mukhamadiyev A, Khujayarov I, Djuraev O, Cho J (2022) Automatic speech recognition method based on deep learning approaches for Uzbek language. Sensors 22(10):3683

3. Tuunanen T (2020) Real-time sound event detection with python. (Master’s thesis)

4. Ohi A, Mridha MF, Hamid MA, Monowar MM (2021) Deep speaker recognition: process, progress, and challenges. IEEE Access 9:89619–89643

5. Le Q, Miralles-Pechuán L, Kulkarni S, Su J (2020) An overview of deep learning in industry. Data Anal AI 1:65–98