Abstract
AbstractSpoofing attacks posed by generating artificial speech can severely degrade the performance of a speaker verification system. Recently, many anti-spoofing countermeasures have been proposed for detecting varying types of attacks from synthetic speech to replay presentations. While there are numerous effective defenses reported on standalone anti-spoofing solutions, the integration for speaker verification and spoofing detection systems has obvious benefits. In this paper, we propose a spoofing-robust automatic speaker verification system for diverse attacks based on a multi-task learning architecture. This deep learning-based model is jointly trained with time-frequency representations from utterances to provide recognition decisions for both tasks simultaneously. Compared with other state-of-the-art systems on the ASVspoof 2017 and 2019 corpora, a substantial improvement of the combined system under different spoofing conditions can be obtained.
Funder
University of Western Australia
Publisher
Springer Science and Business Media LLC
Subject
Applied Mathematics,Signal Processing
Reference46 articles.
1. F. Alegre, A. Amehraye, N. Evans, Spoofing countermeasures to protect automatic speaker verification from voice conversion, in 2013 IEEE International Conference on Acoustics (Speech and Signal Processing (ICASSP) (IEEE, 2013), pp. 3068–3072
2. F. Alegre, R. Vipperla, A. Amehraye, N. Evans, A new speaker verification spoofing countermeasure based on local binary patterns, in Interspeech (2013), pp. 940–944
3. C. Chen, A. Ross, A multi-task convolutional neural network for joint iris detection and presentation attack detection, in 2018 IEEE Winter Applications of Computer Vision Workshops (WACVW) (IEEE, 2018), pp. 44–51
4. D. Chen, B.K.W. Mak, Multitask learning of deep neural networks for low-resource speech recognition. IEEE/ACM Trans. Audio Speech Lang. Process. TASLP 23(7), 1172–1183 (2015)
5. P.L. De Leon, M. Pucher, J. Yamagishi, I. Hernaez, I. Saratxaga, Evaluation of speaker verification security and detection of HMM-based synthetic speech. IEEE/ACM Trans. Audio Speech Lang. Process. 20(8), 2280–2290 (2012)
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献