Affiliation:
1. ADANA SCIENCE AND TECHNOLOGY UNIVERSITY
Abstract
Ensuring security in speaker recognition systems is crucial. In the past years, it has been demonstrated that spoofing attacks can fool these systems. In order to deal with this issue, spoof speech detection systems have been developed. While these systems have served with a good performance, their effectiveness tends to degrade under noise. Traditional speech enhancement methods are not efficient for improving performance, they even make it worse. In this research paper, performance of the noise mask obtained via a convolutional neural network structure for reducing the noise effects was investigated. The mask is used to suppress noisy regions of spectrograms in order to extract robust i-vectors. The proposed system is tested on the ASVspoof 2015 database with three different noise types and accomplished superior performance compared to the traditional systems. However, there is a loss of performance in noise types that are not encountered during training phase.
Publisher
Uludag University Journal of the Faculty of Engineering
Reference31 articles.
1. 1. Alegre, F., Amehraye, A. and Evans, N. (2013) A one-class classification approach to generalized speaker verification spoofing countermeasures using local binary patterns, PInt. Conf. on Biometrics: Theory, Applications and Systems (BTAS), IEEE, Washington DC, USA. doi: 10.1109/BTAS.2013.6712706
2. 2. ASVspoof, (2014). ASVspoof 2015: Automatic speaker verification spoofing and countermeasures challenge evaluation plan. Available: https://www.asvspoof.org/asvSpoof.pdf Accessed: Dec 19,
2014
3. 3. Benhafid, Z., Selouani, S. A., Yakoub, M. S., Amrouche, A. (2021) LARIHS ASSERT reassessment for logical access ASVspoof 2021 challenge. Proceedings of the 2021 Edition of the Automatic Speaker
Verification and Spoofing Countermeasures Challenge, Online, 94-99. doi: 10.21437/ASVSPOOF.2021-15
4. 4. Dean, D., Kanagasundaram, A., Ghaemmaghami, H., Hafizur, M., Sridharan, S. (2015) The QUT-NOISE-SRE protocol for the evaluation of noisy speaker recognition, Interspeech 2015, International
Speech and Communication Association, Dresden. doi: 10.21437/Interspeech.2015-685
5. 5. Dehak, N., Kenny, P. J., Dehak, R., Dumouchel, P., Ouellet, P. (2011) Front-End Factor Analysis for Speaker Verification, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 19(4), 788-798. doi: 10.1109/TASL.2010.2064307