EEG-driven automatic generation of emotive music based on transformer-Reference-Cited by-同舟云学术

EEG-driven automatic generation of emotive music based on transformer

Published:2024-08-19 Issue: Volume:18 Page:
ISSN:1662-5218
Container-title:Frontiers in Neurorobotics
language:
Short-container-title:Front. Neurorobot.

Author:

Jiang Hui,Chen Yu,Wu Di,Yan Jinlin

Abstract

Utilizing deep features from electroencephalography (EEG) data for emotional music composition provides a novel approach for creating personalized and emotionally rich music. Compared to textual data, converting continuous EEG and music data into discrete units presents significant challenges, particularly the lack of a clear and fixed vocabulary for standardizing EEG and audio data. The lack of this standard makes the mapping relationship between EEG signals and musical elements (such as rhythm, melody, and emotion) blurry and complex. Therefore, we propose a method of using clustering to create discrete representations and using the Transformer model to reverse mapping relationships. Specifically, the model uses clustering labels to segment signals and independently encodes EEG and emotional music data to construct a vocabulary, thereby achieving discrete representation. A time series dictionary was developed using clustering algorithms, which more effectively captures and utilizes the temporal and structural relationships between EEG and audio data. In response to the insensitivity to temporal information in heterogeneous data, we adopted a multi head attention mechanism and positional encoding technology to enable the model to focus on information in different subspaces, thereby enhancing the understanding of the complex internal structure of EEG and audio data. In addition, to address the mismatch between local and global information in emotion driven music generation, we introduce an audio masking prediction loss learning method. Our method generates music that Hits@20 On the indicator, a performance of 68.19% was achieved, which improved the score by 4.9% compared to other methods, indicating the effectiveness of this method.

Publisher

Frontiers Media SA

Reference36 articles.

1. Symfornet: application of cross-modal information correspondences based on self-supervision in symbolic music generation;Abudukelimu;Appl. Intell,2024

2. Decoding the user's movements preparation from EEG signals using vision transformer architecture;Al-Quraishi;IEEE Access,2022

3. Musicological indices for soundscape ecological analysis;Bellisario;J. Acoust. Soc. Am,2017

4. “Data augmentation strategies for music composition using generative adversarial networks,” Bitaraes M. Guimaraes F. Coelho F. Congresso Brasileiro de Automática-CBA, Volume 2022

5. Emotions of subject and object affect beauty differently for images and music;Bruns;J. Vis,2023