Predicting Key Recognition Difficulty in Music Using Statistical Learning Techniques-Reference-Cited by-同舟云学术

Predicting Key Recognition Difficulty in Music Using Statistical Learning Techniques

Published:2014-04 Issue:2 Volume:5 Page:54-69
ISSN:1947-8534
Container-title:International Journal of Multimedia Data Engineering and Management
language:en
Short-container-title:

Author:

Chuan Ching-Hua¹,Charapko Aleksey¹

Affiliation:

1. School of Computing, University of North Florida, Jacksonville, FL, USA

Abstract

In this paper, the authors use statistical models to predict the difficulty of recognizing musical keys from polyphonic audio signals. The key recognition difficulty provides important background information when comparing the performance of audio key finding algorithms that often evaluated using different private data sets. Given an audio recording, represented as extracted acoustic features, the authors applied multiple linear regression and proportional odds model to predict the difficulty level of the recording, annotated by three musicians as an integer on a 5-point Likert scale. The authors evaluated the predictions by using root mean square error, Pearson correlation coefficient, exact accuracy, and adjacent accuracy. The authors also discussed issues such as differences found between the musicians' annotations and the consistency of those annotations. To identify potential causes to the perceived difficulty for the individual musicians, the authors applied decision tree-based filtering with bagging. By using weighted naïve Bayes, the authors examined the effectiveness of each identified feature via a classification task.

Publisher

IGI Global

Reference24 articles.

1. Chai, W., & Vercoe, B. (2005). Detection of key change in classical piano music. In Proceedings of the 6th International Conference on Music Information Retrieval.

2. Chew, E. (2000). Towards a mathematical model of tonality. Doctoral dissertation, Department of Operations Research, Massachusetts Institute of Technology.

3. Chuan, C.-H., & Chew, E. (2005). Fuzzy analysis in pitch class determination for polyphonic audio key finding. In Proceedings of the 6th International Conference on Music Information Retrieval (pp. 296-303).

4. Chuan, C.-H., & Chew, E. (2012). Creating ground truth for audio key finding: When the title key may not be the key. In Proceedings of the 13th International Conference on Music Information Retrieval (pp. 247–252).

5. Gómez, E. (2006a). Tonal description of music audio signals. Doctoral dissertation, Universitat Pompeu Fabra, Barcelona, Spain.