A Comparison Study of Deep Learning Methodologies for Music Emotion Recognition

Author:

Louro Pedro Lima1ORCID,Redinho Hugo1ORCID,Malheiro Ricardo12ORCID,Paiva Rui Pedro1ORCID,Panda Renato13ORCID

Affiliation:

1. CISUC, LASI, DEI, FCTUC, University of Coimbra, 3030-790 Coimbra, Portugal

2. School of Technology and Management, Polytechnic Institute of Leiria, 2411-901 Leiria, Portugal

3. Ci2—Smart Cities Research Center, Polytechnic Institute of Tomar, 2300-313 Tomar, Portugal

Abstract

Classical machine learning techniques have dominated Music Emotion Recognition. However, improvements have slowed down due to the complex and time-consuming task of handcrafting new emotionally relevant audio features. Deep learning methods have recently gained popularity in the field because of their ability to automatically learn relevant features from spectral representations of songs, eliminating such necessity. Nonetheless, there are limitations, such as the need for large amounts of quality labeled data, a common problem in MER research. To understand the effectiveness of these techniques, a comparison study using various classical machine learning and deep learning methods was conducted. The results showed that using an ensemble of a Dense Neural Network and a Convolutional Neural Network architecture resulted in a state-of-the-art 80.20% F1 score, an improvement of around 5% considering the best baseline results, concluding that future research should take advantage of both paradigms, that is, combining handcrafted features with feature learning.

Funder

FCT—Foundation for Science and Technology

European Social Fund

Ci2

Publisher

MDPI AG

Reference45 articles.

1. Katayose, H., Imai, M., and Inokuchi, S. (November, January 14). Sentiment extraction in music. Proceedings of the 9th International Conference on Pattern Recognition, Rome, Italy.

2. Feng, Y., Zhuang, Y., and Pan, Y. (August, January 28). Popular Music Retrieval by Detecting Mood. Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Toronto, ON, Canada.

3. Automatic mood detection and tracking of music audio signals;Lu;IEEE Transactions on Audio, Speech, and Language Processing,2006

4. Meyers, O. (2007). A Mood-Based Music Classification and Exploration System, Massachusetts Institute of Technology, School of Architecture and Planning.

5. Novel Audio Features for Music Emotion Recognition;Panda;IEEE Trans. Affect. Comput.,2020

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3