Hybridization of Acoustic and Visual Features of Polish Sibilants Produced by Children for Computer Speech Diagnosis

Author:

Sage Agata1ORCID,Miodońska Zuzanna1ORCID,Kręcichwost Michał1ORCID,Badura Paweł1ORCID

Affiliation:

1. Faculty of Biomedical Engineering, Silesian University of Technology, Roosevelta 40, 41-800 Zabrze, Poland

Abstract

Speech disorders are significant barriers to the balanced development of a child. Many children in Poland are affected by lisps (sigmatism)—the incorrect articulation of sibilants. Since speech therapy diagnostics is complex and multifaceted, developing computer-assisted methods is crucial. This paper presents the results of assessing the usefulness of hybrid feature vectors extracted based on multimodal (video and audio) data for the place of articulation assessment in sibilants /s/ and /ʂ/. We used acoustic features and, new in this field, visual parameters describing selected articulators’ texture and shape. Analysis using statistical tests indicated the differences between various sibilant realizations in the context of the articulation pattern assessment using hybrid feature vectors. In sound /s/, 35 variables differentiated dental and interdental pronunciation, and 24 were visual (textural and shape). For sibilant /ʂ/, we found 49 statistically significant variables whose distributions differed between speaker groups (alveolar, dental, and postalveolar articulation), and the dominant feature type was noise-band acoustic. Our study suggests hybridizing the acoustic description with video processing provides richer diagnostic information.

Funder

National Science Centre, Poland

Polish Ministry of Science, Poland

Publisher

MDPI AG

Reference62 articles.

1. Dyslalia in the Context of Other Speech Defects and Disorders in Preschool and School Children, (PL) Dyslalia na tle innych wad i zaburzeń mowy u dzieci w wieku przedszkolnym i szkolnym;Minczakiewicz;Konteksty Pedagog.,2017

2. Styczek, I. (1980). Logopaedics, (PL) Logopedia, Wydawnictwo Naukowe PWN.

3. Skorek, E. (2001). Faces of Speech Sound Disorders, (PL) Oblicza Wad Wymowy, Wydawnictwo Żak.

4. Jastrzębowska, G. (1998). Basics of Speech Therapy Theory and Diagnosis, (PL) Podstawy Teorii i Diagnozy Logopedycznej, Wydawnictwo Uniwersytetu Opolskiego.

5. Carr, P. (1993). Revision of Phonetics. Phonology, Macmillan Education UK.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3