Transformer-Based Seq2Seq Model for Chord Progression Generation-Reference-Cited by-同舟云学术

Transformer-Based Seq2Seq Model for Chord Progression Generation

Published:2023-02-23 Issue:5 Volume:11 Page:1111
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Li Shuyu¹,Sung Yunsick²^ORCID

Affiliation:

1. Department of Multimedia Engineering, Graduate School, Dongguk University–Seoul, Seoul 04620, Republic of Korea

2. Department of Multimedia Engineering, Dongguk University–Seoul, Seoul 04620, Republic of Korea

Abstract

Machine learning is widely used in various practical applications with deep learning models demonstrating advantages in handling huge data. Treating music as a special language and using deep learning models to accomplish melody recognition, music generation, and music analysis has proven feasible. In certain music-related deep learning research, recurrent neural networks have been replaced with transformers. This has achieved significant results. In traditional approaches with recurrent neural networks, input sequences are limited in length. This paper proposes a method to generate chord progressions for melodies using a transformer-based sequence-to-sequence model, which is divided into a pre-trained encoder and decoder. A pre-trained encoder extracts contextual information from melodies, whereas a decoder uses this information to produce chords asynchronously and finally outputs chord progressions. The proposed method addresses length limitation issues while considering the harmony between chord progressions and melodies. Chord progressions can be generated for melodies in practical music composition applications. Evaluation experiments are conducted using the proposed method and three baseline models. The baseline models included the bidirectional long short-term memory (BLSTM), bidirectional encoder representation from transformers (BERT), and generative pre-trained transformer (GPT2). The proposed method outperformed the baseline models in Hits@k (k = 1) by 25.89, 1.54, and 2.13 %, respectively.

Funder

Ministry of Education of the Republic of Korea and the National Research Foundation of Korea

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/11/5/1111/pdf

Reference26 articles.

1. Ciaburro, G., Iannace, G., and Puyana-Romero, V. (2021, January 17–19). Sentiment Analysis-Based Method to Prevent Cyber Bullying. Proceedings of the 2021 International Conference on Wireless Communications, Networking and Applications, Berlin, Germany.

2. Basaran, D., Essid, S., and Peeters, G. (2018, January 23–27). Main Melody Extraction with Source-Filter NMF and CRNN. Proceedings of the International Society for Music Information Retreival, Paris, France.

3. Li, S., Jang, S., and Sung, Y. (2019). Melody Extraction and Encoding Method for Generating Healthcare Music Automatically. Electronics, 8.

4. Li, S., Jang, S., and Sung, Y. (2019). Automatic Melody Composition Using Enhanced GAN. Mathematics, 7.

5. A Hierarchical Recurrent Neural Network for Symbolic Melody Generation;Wu;IEEE Trans. Cybern.,2019

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Drawlody: Sketch-Based Melody Creation With Enhanced Usability and Interpretability;IEEE Transactions on Multimedia;2024

2. The Chordinator: Modeling Music Harmony by Implementing Transformer Networks and Token Strategies;Lecture Notes in Computer Science;2024

3. Next-Gen Language Mastery: Exploring Advances in Natural Language Processing Post-transformers;Lecture Notes in Networks and Systems;2024

4. Musicological Interpretability in Generative Transformers;2023 4th International Symposium on the Internet of Sounds;2023-10-26

5. Chord-based music generation using long short-term memory neural networks in the context of artificial intelligence;The Journal of Supercomputing;2023-10-16