An open-source toolbox for measuring vocal tract shape from real-time magnetic resonance images-Reference-Cited by-同舟云学术

An open-source toolbox for measuring vocal tract shape from real-time magnetic resonance images

Published:2023-07-28 Issue: Volume: Page:
ISSN:1554-3528
Container-title:Behavior Research Methods
language:en
Short-container-title:Behav Res

Author:

Belyk Michel,Carignan Christopher,McGettigan Carolyn

Abstract

AbstractReal-time magnetic resonance imaging (rtMRI) is a technique that provides high-contrast videographic data of human anatomy in motion. Applied to the vocal tract, it is a powerful method for capturing the dynamics of speech and other vocal behaviours by imaging structures internal to the mouth and throat. These images provide a means of studying the physiological basis for speech, singing, expressions of emotion, and swallowing that are otherwise not accessible for external observation. However, taking quantitative measurements from these images is notoriously difficult. We introduce a signal processing pipeline that produces outlines of the vocal tract from the lips to the larynx as a quantification of the dynamic morphology of the vocal tract. Our approach performs simple tissue classification, but constrained to a researcher-specified region of interest. This combination facilitates feature extraction while retaining the domain-specific expertise of a human analyst. We demonstrate that this pipeline generalises well across datasets covering behaviours such as speech, vocal size exaggeration, laughter, and whistling, as well as producing reliable outcomes across analysts, particularly among users with domain-specific expertise. With this article, we make this pipeline available for immediate use by the research community, and further suggest that it may contribute to the continued development of fully automated methods based on deep learning algorithms.

Publisher

Springer Science and Business Media LLC

Subject

General Psychology,Psychology (miscellaneous),Arts and Humanities (miscellaneous),Developmental and Educational Psychology,Experimental and Cognitive Psychology

Link

https://link.springer.com/content/pdf/10.3758/s13428-023-02171-9.pdf

Reference50 articles.

1. Asadiabadi, S., & Erzin, E. (2020). Vocal tract contour tracking in rtMRI using deep temporal regression network. IEEE/ACM Transactions on Audio Speech and Language Processing, 28, 3053–3064. https://doi.org/10.1109/TASLP.2020.3036182

2. Belyk, M., & McGettigan, C. (2022). Real-time magnetic resonance imaging reveals distinct vocal tract configurations during spontaneous and volitional laughter. Philosophical Transactions of the Royal Society B: Biological Sciences, 377(1863), 20210511. https://doi.org/10.1098/rstb.2021.0511

3. Belyk, M., Schultz, B. G., Correia, J., Beal, D. S., & Kotz, S. A. (2019). Whistling shares a common tongue with speech: Bioacoustics from real-time MRI of the human vocal tract. Proceedings of the Royal Society B, 286, 20191116. https://doi.org/10.1098/rspb.2019.1116

4. Belyk, M., Waters, S., Kanber, E., Miquel, M. E., & McGettigan, C. (2022). Individual differences in vocal size exaggeration. Scientific Reports, 12(1), 1. https://doi.org/10.1038/s41598-022-05170-6

5. Boersma, P., & Weenink, D. (2019). Praat: Doing phonetics by computer. http://www.praat.org/. Accessed 05/11/2021

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Bilinguals from Larynx to Lips: Exploring Bilingual Articulatory Strategies with Anatomic MRI Data;Language and Speech;2024-04-28

2. Research in methodologies for modelling the oral cavity;Biomedical Physics & Engineering Express;2024-03-18