Affiliation:
1. Massachusetts Institute of Technology, Cambridge
2. Massachusetts General Hospital and Harvard Medical School, Boston
Abstract
A large percentage of patients who have undergone laryngectomy to treat advanced laryngeal cancer rely on an electrolarynx (EL) to communicate verbally. Although serviceable, EL speech is plagued by shortcomings in both sound quality and intelligibility. This study sought to better quantify the relative contributions of previously identified acoustic abnormalities to the perception of degraded quality in EL speech. Ten normal listeners evaluated the sound quality of EL speech tokens that had been acoustically enhanced by (a) increased low-frequency energy, (b) EL-noise reduction, and (c) fundamental frequency variation to mimic normal pitch intonation in relation to nonenhanced EL speech, normal speech, and normal monotonous speech (fundamental frequency variation removed). In comparing all possible combinations of token pairs, listeners were asked to identify which one of each pair sounded most like normal natural speech, and then to rate on a visual analog scale how different the chosen token was from normal speech. The results indicate that although EL speech can be most improved by removing the EL noise and providing proper pitch information, the resulting quality is still well below that of normal natural speech or even that of monotonous natural speech. This suggests that, in addition to the widely acknowledged acoustic abnormalities examined in this investigation, there are other attributes that contribute significantly to the unnatural quality of EL speech. Such additional factors need to be clearly identified and remedied before EL speech can be made to more closely approximate the sound quality of normal natural speech.
Publisher
American Speech Language Hearing Association
Subject
Speech and Hearing,Linguistics and Language,Language and Linguistics
Reference39 articles.
1. An experimental transistorized artificial larynx .;Barney H. L.;Readings in speech following total laryngectomy,1959
2. Application of noise reduction techniques for alaryngeal speech enhancement.;Cole D.;Proceedings of IEEE TENCON ”97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications,1997
3. Segmental durations in connected speech signals: Preliminary results
4. Diedrich W. & Youngstrom K. (1977). Alaryngeal speech. Springfield IL: Charles C Thomas.
5. Edwards A. L. (1957). Techniques of attitude scale construction. New York: Appleton-Century-Crofts.
Cited by
46 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献