Generation of Voice Signal Tone Sandhi and Melody Based on Convolutional Neural Network-Reference-Cited by-同舟云学术

Generation of Voice Signal Tone Sandhi and Melody Based on Convolutional Neural Network

Published:2022-09-19 Issue: Volume: Page:
ISSN:2375-4699
Container-title:ACM Transactions on Asian and Low-Resource Language Information Processing
language:en
Short-container-title:ACM Trans. Asian Low-Resour. Lang. Inf. Process.

Author:

Jiang Wei¹^ORCID,Li Mengqi¹^ORCID,Shabaz Mohammad²^ORCID,Sharma Ashutosh³^ORCID,Haq Mohd Anul⁴^ORCID

Affiliation:

1. Department of Music, Shandong University of Science and Technology, Qingdao Shandong, 266590, China

2. Model Institute of Engineering and Technology, Jammu, J&K, India

3. School of Computer Science, University of Petroleum and Energy Studies, Dehradun, India

4. Department of Computer Science, College of Computer Science and Information Science, Majmaah University, Saudi Arabia

Abstract

There is a need to prevent the generation of criminal activities in the voice signals due to changing voices by intruders to cover up their personal identities. The voice signal change detection based on convolutional neural network is proposed in this work that uses three commonly used voice processing software to change the tone of the voice library: Audacity, CoolEdit and RTISI. The research further raises 5 semitones for each voice, which are recorded at different levels, as +4, +5, +6, +7 and +8 respectively. Simultaneously, every speech is lowered by 5 halftones and which are further represented as -4, -5, -6, -7 and -8 respectively. The convolution neural network corresponding to network b-3 is determined as the final classifier in this article through experiments. The average accuracy A1 of its three categories has reached more than 97%, the detection accuracy A2 of electronic tone sandhi speech has reached more than 97%, and the false alarm rate FAR of the original speech is less than 1.9%. The outcomes obtained shows that the detection algorithm in this paper is effective, and it has good generalization ability.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3545569

Reference37 articles.

1. Diagnosing Parkinson's disease with speech signal based on convolutional neural network

2. The development of abstract representations of tone sandhi.

3. Andersen , A. H. , Haan , J. M. D. , Tan , Z. H. , & Jensen , J. , ( 2018 ). Non-intrusive speech intelligibility prediction using convolutional neural networks . IEEE/ACM Transactions on Audio, Speech, and Language Processing, PP( 99), 1-1. Andersen, A. H., Haan, J. M. D., Tan, Z. H., & Jensen, J., (2018). Non-intrusive speech intelligibility prediction using convolutional neural networks. IEEE/ACM Transactions on Audio, Speech, and Language Processing, PP(99), 1-1.

4. Dual Discriminator GAN: Restoring Ancient Yi Characters;Chen S.;Transactions on Asian and Low-Resource Language Information Processing,2022

5. Linguistically Driven Multi-Task Pre-Training for Low-Resource Neural Machine Translation

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Tweet Spam Detection Using Machine Learning and Swarm Optimization Techniques;IEEE Transactions on Computational Social Systems;2022