Abstract
Music is capable of conveying many emotions. The level and type of emotion of the music perceived by a listener, however, is highly subjective. In this study, we present the Music Emotion Recognition with Profile information dataset (MERP). This database was collected through Amazon Mechanical Turk (MTurk) and features dynamical valence and arousal ratings of 54 selected full-length songs. The dataset contains music features, as well as user profile information of the annotators. The songs were selected from the Free Music Archive using an innovative method (a Triple Neural Network with the OpenSmile toolkit) to identify 50 songs with the most distinctive emotions. Specifically, the songs were chosen to fully cover the four quadrants of the valence-arousal space. Four additional songs were selected from the DEAM dataset to act as a benchmark in this study and filter out low quality ratings. A total of 452 participants participated in annotating the dataset, with 277 participants remaining after thoroughly cleaning the dataset. Their demographic information, listening preferences, and musical background were recorded. We offer an extensive analysis of the resulting dataset, together with a baseline emotion prediction model based on a fully connected model and an LSTM model, for our newly proposed MERP dataset.
Funder
Ministry of Education
RIE2020 Advanced Manufacturing and Engineering (AME) Programmatic Fund
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference87 articles.
1. The emergence of deep learning: New opportunities for music and audio technologies;Herremans;Neural Comput. Appl.,2020
2. Yang, Y.H., Su, Y.F., Lin, Y.C., and Chen, H.H. (2007, January 28). Music emotion recognition: The role of individuality. Proceedings of the International Workshop on Human-Centered Multimedia, Augsburg, Bavaria, Germany.
3. Aljanaki, A., Yang, Y.H., and Soleymani, M. (2017). Developing a benchmark for emotional analysis of music. PLoS ONE, 12.
4. Schmidt, E.M., and Kim, Y.E. (2011, January 24–28). Modeling Musical Emotion Dynamics with Conditional Random Fields. Proceedings of the ISMIR, Miami, FL, USA.
5. Chua, P., Makris, D., Herremans, D., Roig, G., and Agres, K. (2022). Predicting emotion from music videos: Exploring the relative contribution of visual and auditory information to affective responses. arXiv.
Cited by
11 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献