Affiliation:
1. School of Electrical and Electronic Engineering, Wuhan Polytechnic University, Wuhan 430023, China
Abstract
Symbolic semantic understanding of staff images is an important technological support to achieve “intelligent score flipping”. Due to the complex composition of staff symbols and the strong semantic correlation between symbol spaces, it is difficult to understand the pitch and duration of each note when the staff is performed. In this paper, we design a semantic understanding system for optical staff symbols. The system uses the YOLOv5 to implement the optical staff’s low-level semantic understanding stage, which understands the pitch and duration in natural scales and other symbols that affect the pitch and duration. The proposed note encoding reconstruction algorithm is used to implement the high-level semantic understanding stage. Such an algorithm understands the logical, spatial, and temporal relationships between natural scales and other symbols based on music theory and outputs digital codes for the pitch and duration of the main notes during performances. The model is trained with a self-constructed SUSN dataset. Experimental results with YOLOv5 show that the precision is 0.989 and that the recall is 0.972. The system’s error rate is 0.031, and the omission rate is 0.021. The paper concludes by analyzing the causes of semantic understanding errors and offers recommendations for further research. The results of this paper provide a method for multimodal music artificial intelligence applications such as notation recognition through listening, intelligent score flipping, and automatic performance.
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference34 articles.
1. Music Deep Learning: Deep Learning Methods for Music Signal Processing—A Review of the State-of-the-Art;Moysis;IEEE Access,2023
2. Tardon, L.J., Barbancho, I., Barbancho, A.M., Peinado, A., Serafin, S., and Avanzini, F. (2019). 16th Sound and Music Computing Conference SMC 2019 (28–31 May 2019, Malaga, Spain). Appl. Sci., 9.
3. Music information retrieval;Downie;Annu. Rev. Inf. Sci. Technol.,2003
4. Content-Based Music Information Retrieval: Current Directions and Future Challenges;Casey;Proc. IEEE,2008
5. Understanding Optical Music Recognition;Pacha;ACM Comput. Surv.,2020