Affiliation:
1. School of Foreign Languages, Nanyang Institute of Technology, Nanyang 473000, China
Abstract
Using computers to help people practice spoken language is a common method, but there are currently some problems. Firstly, because fluency feature is calculated depending on expert knowledge, the key information contained in the original data set may be lost. Secondly, optimize each model’s parameters separately to make the model’s performance in sub-optimal state. In order to solve these problems, a spoken English fluency scoring method based on convolutional neural network is proposed, in order to make the feature extraction consider the short-, medium-, and long-term characteristics of speech signal; three convolution layers are superimposed in this paper, which jointly learns feature extraction and scoring models from the original time-domain signal input. In the feature extraction process, we applied principal component analysis to make useful data extraction of audio features. The experimental results show that the scoring results of the proposed method are more accurate.
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Information Systems
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献