Multi-class identification of tonal contrasts in Chokri using supervised machine learning algorithms

Author:

Gope AmaleshORCID,Pal AnusuyaORCID,Tetseo Sekholu,Gogoi Tulika,J Joanna,Borah Dinkur

Abstract

AbstractThis study examines and explores the effectiveness of various Machine Learning Algorithms (MLAs) in identifying intricate tonal contrasts in Chokri (ISO 639-3), an under-documented and endangered Tibeto-Burman language of the Sino-Tibetan language family spoken in Nagaland, India. Seven different supervised MLAs, viz., [Logistic Regression (LR), Decision Tree (DT), Random Forest (RF), Support Vector Machine (SVM), K-Nearest Neighbors (KNN), Naive Bayes (NB)], and one neural network (NN)-based algorithms [Artificial Neural Network (ANN)] are implemented to explore five-way tonal contrasts in Chokri. Acoustic correlates of tonal contrasts, encompassing fundamental frequency fluctuations, viz., f0 height and f0 direction, are examined. Contrary to the prevailing notion of NN supremacy, this study underscores the impressive accuracy achieved by the RF. Additionally, it reveals that combining f0 height and directionality enhances tonal contrast recognition for female speakers, while f0 directionality alone suffices for male speakers. The findings demonstrate MLAs’ potential to attain accuracy rates of 84–87% for females and 95–97% for males, showcasing their applicability in deciphering the intricate tonal systems of Chokri. The proposed methodology can be extended to predict multi-class problems in diverse fields such as image processing, speech classification, medical diagnosis, computer vision, and social network analysis.

Publisher

Springer Science and Business Media LLC

Reference30 articles.

1. Adank P, Smits R, Van Hout R (2004) A comparison of vowel normalization procedures for language variation research. J Acoust Soc Am 116:3099–3107

2. Boehmke B, Greenwell BM (2019) Hands-on machine learning with R. CRC Press. Boca Raton, Florida, USA

3. Boersma P, Weenink D (2012) Praat: Doing phonetics by computer (version 5.3. 82)[computer software]. Institute of Phonetic Sciences, Amsterdam

4. Brownlee J (2016a) Deep learning with Python: develop deep learning models on Theano and TensorFlow using Keras. Machine Learning Mastery. Australia

5. Brownlee J (2016b) Machine learning mastery with Python: understand your data, create accurate models, and work projects end-to-end. Machine Learning Mastery. Australia

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3