Abstract
Abstract
Background
Pharyngeal fricative is one typical compensatory articulation error of cleft palate speech. It passively influences daily communication for people who suffer from it. The automatic detection of pharyngeal fricatives in cleft palate speech can provide information for clinical doctors and speech-language pathologists to aid in diagnosis.
Results
This paper proposes two features (CSIFs: correlation of signals in independent frequency bands; OSPP: octave spectrum prominent peak) to detect pharyngeal fricative speech. CSIFs feature is proposed to detect the distribution characteristics of frequency components in pharyngeal fricative speech caused by the changed place of articulation and movement of articulators. While OSPP is presented to reflect the concentration degree of prominent peak which is closely related to the place of articulation in pharyngeal fricative, both features are investigated to relate to the altered production process of pharyngeal fricative. To evaluate the capability of these two features to detect pharyngeal fricative, we collected a speech database covering all the types of initial consonants in which pharyngeal fricatives occur. In this detection task, the classifier used to discriminate pharyngeal fricative speech and normal speech is based on ensemble learning.
Conclusion
The detection accuracy obtained with CSIFs and OSPP features ranges from 83.5 to 84.5% and from 85 to 87%, respectively. When these two features are combined, the detection accuracy for pharyngeal fricative speech ranges from 88 to 89%, with an AUC (area under the receiver operating characteristic curve) value of 93%.
Funder
National Natural Science Foundation of China
Publisher
Springer Science and Business Media LLC
Subject
Radiology, Nuclear Medicine and imaging,Biomedical Engineering,General Medicine,Biomaterials,Radiological and Ultrasound Technology
Reference68 articles.
1. He L, Zhang J, Liu Q, Yin H, Lech M. Automatic evaluation of hypernasality and consonant misarticulation in cleft palate speech. IEEE Signal Process. 2014;21(10):1298–301.
2. Kosowski TR, Weathers WM, Wolfswinkel EM, Ridgway EB. Cleft palate. Semin Plast Surg. 2012;26(04):164–9.
3. Lei L. Speech therapy for cleft palate. 1st ed. WuHan: Hubei science and Technology Press; 2004.
4. Trost-Cardamone J. Diagnosis of specific cleft palate speech error patterns for planning therapy or physical management needs. In: Bzoch R, Kenneth R, editors. EdCommunicative disorders related to cleft lip and palate. Austin: Pro-Ed; 1997. p. 313–30.
5. Hermes Z, Barlaz M, Shosted R, Liang ZP, Sutton B. Phonetic correlates of pharyngeal and pharyngealized consonants in Saudi, Lebanese, and Jordanian Arabic: An rt-MRI Study. 201–205. In: proceedinga 3rd annual international conference INTERSPEECH., Sweden, 2016. pp. 201–205.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献