Formants are easy to measure; resonances, not so much: Lessons from Klatt (1986)-Reference-Cited by-同舟云学术

Formants are easy to measure; resonances, not so much: Lessons from Klatt (1986)

Published:2022-08 Issue:2 Volume:152 Page:933-941
ISSN:0001-4966
Container-title:The Journal of the Acoustical Society of America
language:en
Short-container-title:The Journal of the Acoustical Society of America

Author:

Whalen D. H.¹,Chen Wei-Rong¹,Shadle Christine H.¹,Fulop Sean A.²

Affiliation:

1. Haskins Laboratories, New Haven, Connecticut 06511, USA

2. Department of Linguistics, California State University Fresno, Fresno, California 93740, USA

Abstract

Formants in speech signals are easily identified, largely because formants are defined to be local maxima in the wideband sound spectrum. Sadly, this is not what is of most interest in analyzing speech; instead, resonances of the vocal tract are of interest, and they are much harder to measure. Klatt [(1986). in Proceedings of the Montreal Satellite Symposium on Speech Recognition, 12th International Congress on Acoustics, edited by P. Mermelstein (Canadian Acoustical Society, Montreal), pp. 5–7] showed that estimates of resonances are biased by harmonics while the human ear is not. Several analysis techniques placed the formant closer to a strong harmonic than to the center of the resonance. This “harmonic attraction” can persist with newer algorithms and in hand measurements, and systematic errors can persist even in large corpora. Research has shown that the reassigned spectrogram is less subject to these errors than linear predictive coding and similar measures, but it has not been satisfactorily automated, making its wider use unrealistic. Pending better techniques, the recommendations are (1) acknowledge limitations of current analyses regarding influence of F0 and limits on granularity, (2) report settings more fully, (3) justify settings chosen, and (4) examine the pattern of F0 vs F1 for possible harmonic bias.

Funder

National Institute on Deafness and Other Communication Disorders

Publisher

Acoustical Society of America (ASA)

Subject

Acoustics and Ultrasonics,Arts and Humanities (miscellaneous)

Link

https://asa.scitation.org/doi/pdf/10.1121/10.0013410

Reference58 articles.

1. Formant frequency estimation of high-pitched vowels using weighted linear prediction

2. Calculation of true glottal flow and its components

3. Speech Analysis and Synthesis by Linear Prediction of the Speech Wave