Affiliation:
1. Center for Music Technology, Georgia Institute of Technology, 840 McMillan St, Atlanta, GA 30332, USA
Abstract
The automatic assessment of (student) music performance involves the characterization of the audio recordings and the modeling of human judgments. To build a computational model that provides a reliable assessment, the system must take into account various aspects of a performance including technical correctness and aesthetic standards. While some progress has been made in recent years, the search for an effective feature representation remains open-ended. In this study, we explore the possibility of using learned features from sparse coding. Specifically, we investigate three sets of features, namely a baseline set, a set of designed features, and a feature set learned with sparse coding. In addition, we compare the impact of two different input representations on the effectiveness of the learned features. The evaluation is performed on a dataset of annotated recordings of students playing snare exercises. The results imply the general viability of feature learning in the context of automatic assessment of music performances.
Publisher
World Scientific Pub Co Pte Lt
Subject
Artificial Intelligence,Computer Networks and Communications,Computer Science Applications,Linguistics and Language,Information Systems,Software
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Input Representation;An Introduction to Audio Content Analysis;2022-10-25
2. Audioinhaltsanalyse;Handbuch der Audiotechnik;2022
3. An Interdisciplinary Review of Music Performance Analysis;Transactions of the International Society for Music Information Retrieval;2020
4. Wavelet-packets Associated with Support Vector Machine Are Effective for Monophone Sorting in Music Signals;International Journal of Semantic Computing;2019-09