Voiceprint Identification for Limited Dataset Using the Deep Migration Hybrid Model Based on Transfer Learning-Reference-Cited by-同舟云学术

Voiceprint Identification for Limited Dataset Using the Deep Migration Hybrid Model Based on Transfer Learning

Published:2018-07-23 Issue:7 Volume:18 Page:2399
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Sun Cunwei,Yang Yuxin,Wen Chang,Xie Kai,Wen Fangqing

Abstract

The convolutional neural network (CNN) has made great strides in the area of voiceprint recognition; but it needs a huge number of data samples to train a deep neural network. In practice, it is too difficult to get a large number of training samples, and it cannot achieve a better convergence state due to the limited dataset. In order to solve this question, a new method using a deep migration hybrid model is put forward, which makes it easier to realize voiceprint recognition for small samples. Firstly, it uses Transfer Learning to transfer the trained network from the big sample voiceprint dataset to our limited voiceprint dataset for the further training. Fully-connected layers of a pre-training model are replaced by restricted Boltzmann machine layers. Secondly, the approach of Data Augmentation is adopted to increase the number of voiceprint datasets. Finally, we introduce fast batch normalization algorithms to improve the speed of the network convergence and shorten the training time. Our new voiceprint recognition approach uses the TLCNN-RBM (convolutional neural network mixed restricted Boltzmann machine based on transfer learning) model, which is the deep migration hybrid model that is used to achieve an average accuracy of over 97%, which is higher than that when using either CNN or the TL-CNN network (convolutional neural network based on transfer learning). Thus, an effective method for a small sample of voiceprint recognition has been provided.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

http://www.mdpi.com/1424-8220/18/7/2399/pdf

Reference34 articles.

1. Convolutional Neural Networks for Speech Recognition

2. Speaker recognition under limited data condition by noise addition

Cited by 35 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Text-independent voiceprint recognition via compact embedding of dilated deep convolutional neural networks;Computers and Electrical Engineering;2024-09

2. Classification of Pneumonia via a Hybrid ZFNet-Quantum Neural Network Using a Chest X-ray Dataset;Current Medical Imaging Formerly Current Medical Imaging Reviews;2024-08-22

3. SDI: A tool for speech differentiation in user identification;Expert Systems with Applications;2024-06

4. A stacked convolutional neural network framework with multi-scale attention mechanism for text-independent voiceprint recognition;Pattern Analysis and Applications;2024-04-27

5. Topology Optimization Design Method for Acoustic Imaging Array of Power Equipment;Sensors;2024-03-22