EduSpeak®: A speech recognition and pronunciation scoring toolkit for computer-aided language learning applications-Reference-Cited by-同舟云学术

EduSpeak®: A speech recognition and pronunciation scoring toolkit for computer-aided language learning applications

Published:2010-07 Issue:3 Volume:27 Page:401-418
ISSN:0265-5322
Container-title:Language Testing
language:en
Short-container-title:Language Testing

Author:

Franco Horacio¹,Bratt Harry²,Rossier Romain²,Rao Gadde Venkata²,Shriberg Elizabeth²,Abrash Victor²,Precoda Kristin²

Affiliation:

1. SRI International, USA,

2. SRI International, USA

Abstract

SRI International’s EduSpeak® system is a software development toolkit that enables developers of interactive language education software to use state-of-the-art speech recognition and pronunciation scoring technology. Automatic pronunciation scoring allows the computer to provide feedback on the overall quality of pronunciation and to point to specific production problems. We review our approach to pronunciation scoring, where our aim is to estimate the grade that a human expert would assign to the pronunciation quality of a paragraph or a phrase. Using databases of nonnative speech and corresponding human ratings at the sentence level, we evaluate different machine scores that can be used as predictor variables to estimate pronunciation quality. For more specific feedback on pronunciation, the EduSpeak toolkit supports a phone-level mispronunciation detection functionality that automatically flags specific phone segments that have been mispronounced. Phone-level information makes it possible to provide the student with feedback about specific pronunciation mistakes.Two approaches to mispronunciation detection were evaluated in a phonetically transcribed database of 130,000 phones uttered in continuous speech sentences by 206 nonnative speakers. Results show that classification error of the best system, for the phones that can be reliably transcribed, is only slightly higher than the average pairwise disagreement between the human transcribers.

Publisher

SAGE Publications

Subject

Linguistics and Language,Social Sciences (miscellaneous),Language and Linguistics

Link

http://journals.sagepub.com/doi/pdf/10.1177/0265532210364408

Reference23 articles.

Cited by 46 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A computer-assisted tool for automatically measuring non-native Japanese oral proficiency;Computer Assisted Language Learning;2024-07-17

2. Natural Language Processing Applications in Language Assessment;Fostering Foreign Language Teaching and Learning Environments With Contemporary Technologies;2024-02-09

3. Controllable Accented Text-to-Speech Synthesis With Fine and Coarse-Grained Intensity Rendering;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024

4. An Effective Hierarchical Graph Attention Network Modeling Approach for Pronunciation Assessment;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024

5. Explicit Intensity Control for Accented Text-to-speech;INTERSPEECH 2023;2023-08-20