Voice spoofing detection using a neural networks assembly considering spectrograms and mel frequency cepstral coefficients-Reference-Cited by-同舟云学术

Voice spoofing detection using a neural networks assembly considering spectrograms and mel frequency cepstral coefficients

Published:2023-12-18 Issue: Volume:9 Page:e1740
ISSN:2376-5992
Container-title:PeerJ Computer Science
language:en
Short-container-title:

Author:

Hernández-Nava Carlos Alberto¹^ORCID,Rincón-García Eric Alfredo²,Lara-Velázquez Pedro²,de-los-Cobos-Silva Sergio Gerardo²,Gutiérrez-Andrade Miguel Angel²,Mora-Gutiérrez Roman Anselmo³

Affiliation:

1. Posgrado en Ciencias y Tecnologías de la Información, Universidad Autónoma Metropolitana, Ciudad de México, Ciudad de México, México

2. Departamento de Ingeniería Eléctrica, Universidad Autónoma Metropolitana, Ciudad de México, Ciudad de México, México

3. Departamento de Sistemas, Universidad Autónoma Metropolitana de Azcapotzalco, Ciudad de México, Ciudad de México, México

Abstract

Nowadays, biometric authentication has gained relevance due to the technological advances that have allowed its inclusion in many daily-use devices. However, this same advantage has also brought dangers, as spoofing attacks are now more common. This work addresses the vulnerabilities of automatic speaker verification authentication systems, which are prone to attacks arising from new techniques for the generation of spoofed audio. In this article, we present a countermeasure for these attacks using an approach that includes easy to implement feature extractors such as spectrograms and mel frequency cepstral coefficients, as well as a modular architecture based on deep neural networks. Finally, we evaluate our proposal using the well-know ASVspoof 2017 V2 database, the experiments show that using the final architecture the best performance is obtained, achieving an equal error rate of 6.66% on the evaluation set.

Funder

Autonomous Metropolitan University

Publisher

PeerJ

Subject

General Computer Science

Link

https://peerj.com/articles/cs-1740.pdf

Reference28 articles.

1. Toward robust audio spoofing detection: a detailed comparison of traditional and learned features;Balamurali;IEEE Access,2019

2. ResNet and model fusion for automatic spoofing detection;Chen,2017

3. Instantaneous phase and excitation source features for detection of replay attacks;Das,2018

4. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences;Davis;IEEE Transactions on Acoustics,1980

5. ASVspoof 2017 Version 2.0: meta-data analysis and baseline enhancements;Delgado,2018