Printable 3D vocal tract shapes from MRI data and their acoustic and aerodynamic properties-Reference-Cited by-同舟云学术

Printable 3D vocal tract shapes from MRI data and their acoustic and aerodynamic properties

Published:2020-08-05 Issue:1 Volume:7 Page:
ISSN:2052-4463
Container-title:Scientific Data
language:en
Short-container-title:Sci Data

Author:

Birkholz Peter,Kürbis Steffen,Stone Simon,Häsner Patrick,Blandin Rémi,Fleischer Mario^ORCID

Abstract

AbstractA detailed understanding of how the acoustic patterns of speech sounds are generated by the complex 3D shapes of the vocal tract is a major goal in speech research. The Dresden Vocal Tract Dataset (DVTD) presented here contains geometric and (aero)acoustic data of the vocal tract of 22 German speech sounds (16 vowels, 5 fricatives, 1 lateral), each from one male and one female speaker. The data include the 3D Magnetic Resonance Imaging data of the vocal tracts, the corresponding 3D-printable and finite-element models, and their simulated and measured acoustic and aerodynamic properties. The dataset was evaluated in terms of the plausibility and the similarity of the resonance frequencies determined by the acoustic simulations and measurements, and in terms of the human identification rate of the vowels and fricatives synthesized by the artificially excited 3D-printed vocal tract models. According to both the acoustic and perceptual metrics, most models are accurate representations of the intended speech sounds and can be readily used for research and education.

Publisher

Springer Science and Business Media LLC

Subject

Library and Information Sciences,Statistics, Probability and Uncertainty,Computer Science Applications,Education,Information Systems,Statistics and Probability

Link

https://www.nature.com/articles/s41597-020-00597-w.pdf

Reference57 articles.

1. Sorensen, T. et al. Database of volumetric and real-time vocal tract MRI for speech science. In Proc. of the Interspeech 2017, 645–649 (2017).

2. Niebergall, A. et al. Real-time MRI of speaking at a resolution of 33 ms: undersampled radial FLASH with nonlinear inverse reconstruction. Magn. Reson. Med. 69, 477–485 (2013).

3. Fischer, J. et al. Magnetic resonance imaging of the vocal fold oscillations with sub-millisecond temporal resolution. Magn. Reson. Med. 83, 403–411 (2020).

4. Baer, T., Gore, J. C., Gracco, L. C. & Nye, P. W. Analysis of vocal tract shape and dimensions using Magnetic Resonance Imaging: vowels. J. Acoust. Soc. Am. 90, 799–828 (1991).

5. Story, B. H., Titze, I. R. & Hoffman, E. A. Vocal tract area functions from magnetic resonance imaging. J. Acoust. Soc. Am. 100, 537–554 (1996).

Cited by 24 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Research in methodologies for modelling the oral cavity;Biomedical Physics & Engineering Express;2024-03-18

2. Assessing accuracy of resonances obtained with reassigned spectrograms from the “ground truth” of physical vocal tract models;The Journal of the Acoustical Society of America;2024-02-01

3. Formant Frequency Tuning of Three-Dimensional MRI-Based Vocal Tracts for the Finite Element Synthesis of Vowels;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024

4. Acoustics of Breath Noises in Human Speech: Descriptive and Three-Dimensional Modeling Approaches;Journal of Speech, Language, and Hearing Research;2023-11-16

5. Bandwidths of vocal tract resonances in physical models compared to transmission-line simulations;The Journal of the Acoustical Society of America;2023-06-01