Tongue Contour Tracking and Segmentation in Lingual Ultrasound for Speech Recognition: A Review-Reference-Cited by-同舟云学术

Tongue Contour Tracking and Segmentation in Lingual Ultrasound for Speech Recognition: A Review

Published:2022-11-15 Issue:11 Volume:12 Page:2811
ISSN:2075-4418
Container-title:Diagnostics
language:en
Short-container-title:Diagnostics

Author:

Al-hammuri Khalid^ORCID,Gebali Fayez,Thirumarai Chelvan Ilamparithi,Kanan Awos^ORCID

Abstract

Lingual ultrasound imaging is essential in linguistic research and speech recognition. It has been used widely in different applications as visual feedback to enhance language learning for non-native speakers, study speech-related disorders and remediation, articulation research and analysis, swallowing study, tongue 3D modelling, and silent speech interface. This article provides a comparative analysis and review based on quantitative and qualitative criteria of the two main streams of tongue contour segmentation from ultrasound images. The first stream utilizes traditional computer vision and image processing algorithms for tongue segmentation. The second stream uses machine and deep learning algorithms for tongue segmentation. The results show that tongue tracking using machine learning-based techniques is superior to traditional techniques, considering the performance and algorithm generalization ability. Meanwhile, traditional techniques are helpful for implementing interactive image segmentation to extract valuable features during training and postprocessing. We recommend using a hybrid approach to combine machine learning and traditional techniques to implement a real-time tongue segmentation tool.

Funder

National Research Council of Canada

Publisher

MDPI AG

Subject

Clinical Biochemistry

Link

https://www.mdpi.com/2075-4418/12/11/2811/pdf

Reference126 articles.

1. Review articles: Purpose, process, and structure;J. Acad. Mark. Sci.,2018

2. Automatic contour tracking in ultrasound images;Clin. Linguist. Phon.,2005

3. Tongue contour tracking in dynamic ultrasound via higher-order MRFs and efficient fusion moves;Med. Image Anal.,2012