Automatic Assessment of Speech Capability Loss in Disordered Speech

Author:

Pellegrini Thomas1,Fontan Lionel1,Mauclair Julie2,Farinas Jérôme1,Alazard-Guiu Charlotte3,Robert Marina4,Gatignol Peggy5

Affiliation:

1. Université de Toulouse; UPS; IRIT, Toulouse, France

2. Université Paris Descartes; IRIT, Paris, France

3. Université de Toulouse; Octogone-Lordat, TOULOUSE Cedex

4. Université Paris Ouest, Nanterre, France

5. Hôpital de la Pitié Salpétrière, Paris, France

Abstract

In this article, we report on the use of an automatic technique to assess pronunciation in the context of several types of speech disorders. Even if such tools already exist, they are more widely used in a different context, namely, Computer-Assisted Language Learning, in which the objective is to assess nonnative pronunciation by detecting learners’ mispronunciations at segmental and/or suprasegmental levels. In our work, we sought to determine if the Goodness of Pronunciation (GOP) algorithm, which aims to detect phone-level mispronunciations by means of automatic speech recognition, could also detect segmental deviances in disordered speech. Our main experiment is an analysis of speech from people with unilateral facial palsy. This pathology may impact the realization of certain phonemes such as bilabial plosives and sibilants. Speech read by 32 speakers at four different clinical severity grades was automatically aligned and GOP scores were computed for each phone realization. The highest scores, which indicate large dissimilarities with standard phone realizations, were obtained for the most severely impaired speakers. The corresponding speech subset was manually transcribed at phone level; 8.3% of the phones differed from standard pronunciations extracted from our lexicon. The GOP technique allowed the detection of 70.2% of mispronunciations with an equal rate of about 30% of false rejections and false acceptances. Finally, to broaden the scope of the study, we explored the correlation between GOP values and speech comprehensibility scores on a second corpus, composed of sentences recorded by six people with speech impairments due to cancer surgery or neurological disorders. Strong correlations were achieved between GOP scores and subjective comprehensibility scores (about 0.7 absolute). Results from both experiments tend to validate the use of GOP to measure speech capability loss, a dimension that could be used as a complement to physiological measures in pathologies causing speech disorders.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Science Applications,Human-Computer Interaction

Reference28 articles.

1. A.-C. Albinhac and A. Rodier. 2003. Analyse quantitative et qualitative des troubles d’articulation dans les paralysies faciales périphériques. Mémoire pour l’obtention du CCO. Paris. A.-C. Albinhac and A. Rodier. 2003. Analyse quantitative et qualitative des troubles d’articulation dans les paralysies faciales périphériques. Mémoire pour l’obtention du CCO. Paris.

2. An overview of spoken language technology for education

3. Reliability of the House and Brackmann grading system for facial palsy

4. L. Fontan P. Gaillard and V. Woisard. 2014. Comprendre et agir: Les tests pragmatiques de compréhension de la parole et EloKanz. In Travaux en phonétique clinique R. Sock B. Vaxelaire and C. Fauth (Eds.). 131--144. L. Fontan P. Gaillard and V. Woisard. 2014. Comprendre et agir: Les tests pragmatiques de compréhension de la parole et EloKanz. In Travaux en phonétique clinique R. Sock B. Vaxelaire and C. Fauth (Eds.). 131--144.

Cited by 7 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Exploring the Role of Machine Learning in Diagnosing and Treating Speech Disorders: A Systematic Literature Review;Psychology Research and Behavior Management;2024-05

2. Automatic Rating of Spontaneous Speech for Low-Resource Languages;2022 IEEE Spoken Language Technology Workshop (SLT);2023-01-09

3. Reviewing Speech Input with Audio;ACM Transactions on Accessible Computing;2020-04-23

4. Autism detection based on eye movement sequences on the web;Proceedings of the 17th International Web for All Conference;2020-04-17

5. Intelligibility of Disordered Speech: Global and Detailed Scores;Interspeech 2016;2016-09-08

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3