Enhanced-Deep-Residual-Shrinkage-Network-Based Voiceprint Recognition in the Electric Industry
-
Published:2023-07-10
Issue:14
Volume:12
Page:3017
-
ISSN:2079-9292
-
Container-title:Electronics
-
language:en
-
Short-container-title:Electronics
Author:
Zhang Qingrui1, Zhai Hongting1, Ma Yuanyuan2, Sun Lili1, Zhang Yantong1, Quan Weihong1, Zhai Qi1, He Bangwei2, Bai Zhiquan2
Affiliation:
1. Information and Telecommunications Branch, State Grid Shandong Electric Power Company, Jinan 250001, China 2. School of Information Science and Engineering, Shandong University, Qingdao 266237, China
Abstract
Voiceprint recognition can extract voice features and identity the speaker through the voice information, which has great application prospects in personnel identity verification and voice dispatching in the electric industry. The traditional voiceprint recognition algorithms work well in a quiet environment. However, noise interference inevitably exists in the electric industry, degrading the accuracy of traditional voiceprint recognition algorithms. In this paper, we propose an enhanced deep residual shrinkage network (EDRSN)-based voiceprint recognition by combining the traditional voiceprint recognition algorithms with deep learning (DL) in the context of the noisy electric industry environment, where a dual-path convolution recurrent network (DPCRN) is employed to reduce the noise, and its structure is also improved based on the deep residual shrinkage network (DRSN). Moreover, we further use a convolutional block attention mechanism (CBAM) module and a hybrid dilated convolution (HDC) in the proposed EDRSN. Simulation results show that the proposed network can enhance the speaker’s vocal features and further distinguish and eliminate the noise features, thus reducing the noise influence and achieving better recognition performance in a noisy electric environment.
Funder
State Grid Corporation of China
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference29 articles.
1. Huang, Z., Zhang, X., Wang, L., and Li, Z. (2017, January 14–16). Study and implementation of voiceprint identity authentication for Android mobile terminal. Proceedings of the 2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI), Shanghai, China. 2. Jin, X., Zhang, Y., and Wang, X. (2012, January 21–24). Strategy and coordinated development of strong and smart grid. Proceedings of the IEEE PES Innovative Smart Grid Technologies, Tianjin, China. 3. Le, X., Chen, H., Chen, K., and Lu, J. (2021). DPCRN: Dual-path convolution recurrent network for single channel speech enhancement. arXiv. 4. Inference Skipping for More Efficient Real-Time Speech Enhancement With Parallel RNNs;Le;IEEE/ACM Trans. Audio Speech Lang. Process.,2022 5. Lin, N., Chen, G., Zhou, Q., and Liu, C. (2021, January 22–24). Dilated Residual Shrinkage Network for SAR Image Despeckling. Proceedings of the 2021 IEEE 6th International Conference on Signal and Image Processing (ICSIP), Nanjing, China.
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|