Neural network methods for vowel classification in the vocalic systems with the [ATR] (Advanced Tongue Root) contrast

Author:

Makeeva N. V.1

Affiliation:

1. Institute of Linguistics RAS

Abstract

The paper aims to discuss the results of testing a neural network which classifies the vowels of the vocalic system with the [ATR] (Advanced Tongue Root) contrast based on the data of Akebu (Kwa family). The acoustic nature of the [ATR] feature is yet understudied. The only reliable acoustic correlate of [ATR] is the magnitude of the first formant (F1) which can be also modulated by tongue height, resulting in significant overlap between high [-ATR] vowels and mid [+ATR] vowels. Other acoustic metrics which had been associated with the [ATR], such as F1 bandwidth (B1), relative intensity of F1 to F2 (A1-A2), etc., are typically inconsistent across vowel types and speakers. The values of four metrics – F1, F2, A1-A2, B1 – were used for training and testing the neural network. We tested four versions of the model differing in the presence of the fifth variable encoding the speaker and the number of hidden layers. The models which included the variable encoding the speaker achieved slightly higher accuracy, whereas the precision and recall metrics of the three-layer model were generally higher than those with two hidden layers.

Publisher

Pyatigorsk State University

Subject

General Medicine

Reference22 articles.

1. Surkanova I. M. (1978). O nekotorykh artikulyatorno-akusticheskikh kharakteristikakh vokalizma yazyka ibo. Problemy fonetiki, morfologii i sintaksisa afrikanskikh yazykov. M.: Izdatel'stvo Moskovskogo universiteta. 166-204.

2. Allen B., Pulleyblank D., Ajíbóyè O. (2013). Articulatory mapping of Yoruba vowels: An ultrasound study. Phonology, 30. 183-210.

3. Edmondson J.A. & Esling J. H. (2006). The valves of the throat and their functioning in tone, vocal register and stress: Laryngoscopic case studies. Phonology, 23. 157-191.

4. Edmondson J. A. (2008). Correspondences between articulation and acoustics for the feature [ATR]: the case of two Tibeto-Burman languages and two African languages. Ms.

5. Esling J. H., Moisik S. R., Benner A., Crevier-Buchman L. (2019). Voice quality: The laryngeal articulator model. Cambridge: Cambridge Univ. Press.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3