BLNN:a muscular and tall architecture for emotion prediction in music-Reference-Cited by-同舟云学术

BLNN:a muscular and tall architecture for emotion prediction in music

Published:2024-07-18 Issue: Volume: Page:
ISSN:1432-7643
Container-title:Soft Computing
language:en
Short-container-title:Soft Comput

Author:

Du Xiaofeng

Abstract

AbstractIn order to perform emotion prediction in music quickly and accurately, we have proposed a muscular and tall neural network architecture for music emotion classification. Specifically, during the audio pre-processing stage, we converge mel-scale frequency cepstral coefficients features and residual phase features with weighting, enabling the extraction of more comprehensive music emotion characteristics. Additionally, to enhance the accuracy of predicting musical emotion while reducing computational complexity during training phase, we consolidate Long short term memory network with Broad learning system network. We employ long short term memory structure as the feature mapping node of broad learning system structure, leveraging the advantages of both network models. This novel Neural Network architecture, called BLNN (Broad-Long Neural Network), achieves higher prediction accuracy. i.e., 66.78%, than single network models and other benchmark with/without consolidation methods. Moreover, it achieves lower time complexity than other excellent models, i.e., 169.32 s of training time and 507.69 ms of inference time, and achieves the optimal balance between efficiency and performance. In short, the extensive experimental results demonstrate that the proposed BLNN architecture effectively predicts music emotion, surpassing other models in terms of accuracy while reducing computational demands. In addition, the detailed description of the related work, along with an analysis of its advantages and disadvantages, and its future prospects, can serve as a valuable reference for future researchers.

Funder

key subject of Shandong Province's art and science

Shandong Province Arts Science Key Project

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s00500-024-09922-6.pdf

Reference39 articles.

1. Agrawal Y, Shanker RGR, Alluri V (2021) Transformer-based approach towards music emotion recognition from lyrics. In: European conference on information retrieval. Springer International Publishing, Cham, pp 167–175

2. AlShaikhi A, Nuha HH, Lawal A et al (2023) Vertical wind profile estimation using hybrid convolutional neural networks and bidirectional long short-term memory. Arabian J Sci Eng M48(5):6915–6924. https://doi.org/10.1007/s13369-023-07665-4

3. Bolboacă R, Haller P (2023) Performance analysis of long short-term memory predictive neural networks on time series data. Mathematics 11:1432

4. Chen C, Li Q (2020) A multimodal music emotion classification method based on multifeature combined network classifier. Math Probl Eng. https://doi.org/10.1155/2020/4606027

5. Chen H, Zhang B (2021) Adaptive algorithm for feature selection of speech emotion recognition based on genetic algorithm and SVM. J Phys Conf Ser 1:012019. https://doi.org/10.1088/1742-6596/1883/1/012019