A Deeper Look at Sheet Music Composer Classification Using Self-Supervised Pretraining-Reference-Cited by-同舟云学术

A Deeper Look at Sheet Music Composer Classification Using Self-Supervised Pretraining

Published:2021-02-04 Issue:4 Volume:11 Page:1387
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Yang Daniel,Ji Kevin,Tsai TJ^ORCID

Abstract

This article studies a composer style classification task based on raw sheet music images. While previous works on composer recognition have relied exclusively on supervised learning, we explore the use of self-supervised pretraining methods that have been recently developed for natural language processing. We first convert sheet music images to sequences of musical words, train a language model on a large set of unlabeled musical “sentences”, initialize a classifier with the pretrained language model weights, and then finetune the classifier on a small set of labeled data. We conduct extensive experiments on International Music Score Library Project (IMSLP) piano data using a range of modern language model architectures. We show that pretraining substantially improves classification performance and that Transformer-based architectures perform best. We also introduce two data augmentation strategies and present evidence that the model learns generalizable and semantically meaningful information.

Funder

Brian Butler HMC Faculty Enhancement Fund

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/11/4/1387/pdf

Reference57 articles.

1. Understanding Optical Music Recognition

2. Learning Audio–Sheet Music Correspondences for Cross-Modal Retrieval and Piece Identification

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Composer classification using melodic combinatorial n-grams;Expert Systems with Applications;2024-09

2. Score Images as a Modality: Enhancing Symbolic Music Understanding through Large-Scale Multimodal Pre-Training;Sensors;2024-08-02

3. Multi-Instrument Based N-Grams for Composer Classification Task;Computación y Sistemas;2024-03-20

4. PBSCR: The Piano Bootleg Score Composer Recognition Dataset;Transactions of the International Society for Music Information Retrieval;2024

5. Design of Music Style Classification Teaching System based on BP Neural Network;2022 International Conference on Information System, Computing and Educational Technology (ICISCET);2022-05