A new intonation quality evaluation method based on self-supervised learning

Author:

Wang Wei12,Zhang Ning2,Peng Weishi3,Liu Zhengqi4

Affiliation:

1. School of Humanities and Social Science, Xi’an Jiaotong University, Xi’an, China

2. International Collaborative Innovation Center of Music Intelligence, Xi’an Conservatory of Music, Xi’an, China

3. School of Equipment Management and Support, People Armed Police Engineering University, Xi’an, Shaanxi, China

4. School of Information Sciences and Technology, Northwest University, Xi’an, Shaanxi, China

Abstract

Intonation evaluation is an important precondition that offers guidance to music practices. This paper present a new intonation quality evaluation method based on self-supervised learning to solve the fuzzy evaluation problem at the critical intonations. Firstly, the effective features of audios are automatically extracted by a self-supervised learning-based deep neural network. Secondly, the intonation evaluation of the single tones and pitch intervals are carried out by combining with the key local features of the audios. Finally, the intonation evaluation method characterized by physical calculations, which simulates and enhances the manual assessment. Experimental results show that the proposed method achieved the accuracy of 93.38% which is the average value of multiple experimental results obtained by randomly assigning audio data, which is much higher than that of the frequency-based intonation evaluation method(37.5%). In addition, this method has been applied in music teaching for the first time and delivers visual evaluation results.

Publisher

IOS Press

Subject

Artificial Intelligence,General Engineering,Statistics and Probability

Reference31 articles.

1. Consonance and dissonanceperception. A critical review of the historical sources,multidisciplinary findings, and main hypotheses;Stefano;Physics ofLife Reviews,2022

2. Raga recognition usingfibonacci series based pitch distribution in Indian Classical Music;Sinith;Applied Acoustics,2020

3. Temperament in Tuning Systems of Southeast Asia andAncient India;Bader;1st International Symposium on ComputationalEthnomusicological Archiving (ISCEA),2019

4. Non-Autoregressive ASR ModelingUsing Pre-Trained Language Models for Chinese Speech Recognition;Yu;IEEE-ACM Transaction on Audio Speech and Language Processing,2022

5. Language Agnostic Speaker Embeddingfor Cross-Lingual Personalized Speech Generation;Zhou;IEEE-ACMTransaction on Audio Speech and Language Processing

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3