Evaluation of Glottal Inverse Filtering Techniques on OPENGLOT Synthetic Male and Female Vowels

Author:

Freixes Marc1ORCID,Joglar-Ongay Luis1ORCID,Socoró Joan Claudi1ORCID,Alías-Pujol Francesc1ORCID

Affiliation:

1. Human-Environment Research (HER), La Salle—Universitat Ramon Llull, Sant Joan de la Salle, 42, 08022 Barcelona, Spain

Abstract

Current articulatory-based three-dimensional source–filter models, which allow the production of vowels and diphtongs, still present very limited expressiveness. Glottal inverse filtering (GIF) techniques can become instrumental to identify specific characteristics of both the glottal source signal and the vocal tract transfer function to resemble expressive speech. Several GIF methods have been proposed in the literature; however, their comparison becomes difficult due to the lack of common and exhaustive experimental settings. In this work, first, a two-phase analysis methodology for the comparison of GIF techniques based on a reference dataset is introduced. Next, state-of-the-art GIF techniques based on iterative adaptive inverse filtering (IAIF) and quasi closed phase (QCP) approaches are thoroughly evaluated on OPENGLOT, an open database specifically designed to evaluate GIF, computing well-established GIF error measures after extending male vowels with their female counterparts. The results show that GIF methods obtain better results on male vowels. The QCP-based techniques significantly outperform IAIF-based methods for almost all error metrics and scenarios and are, at the same time, more stable across sex, phonation type, F0, and vowels. The IAIF variants improve the original technique for most error metrics on male vowels, while QCP with spectral tilt compensation achieves a lower spectral tilt error for male vowels than the original QCP.

Funder

Agencia Estatal de Investigación

Departament de Recerca i Universitats

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Reference40 articles.

1. Birkholz, P., Jackel, D., and Kroger, B. (2006, January 14–19). Construction And Control Of A Three-Dimensional Vocal Tract Model. Proceedings of the 2006 IEEE ICASSP Proceedings, Toulouse, France.

2. Effects of higher order propagation modes in vocal tract like geometries;Blandin;J. Acoust. Soc. Am.,2015

3. Influence of vocal tract geometry simplifications on the numerical simulation of vowel sounds;Arnela;J. Acoust. Soc. Am.,2016

4. MRI-based vocal tract representations for the three-dimensional finite element synthesis of diphthongs;Arnela;IEEE/ACM Trans. Audio Speech Lang. Process.,2019

5. Simulation of vowel-vowel utterances using a 3D biomechanical-acoustic model;Dabbaghchian;Int. J. Numer. Methods Biomed. Eng.,2021

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3