Affiliation:
1. Department of Multimedia Engineering, Graduate School, Dongguk University–Seoul, Seoul 04620, Republic of Korea
2. Department of Multimedia Engineering, Dongguk University–Seoul, Seoul 04620, Republic of Korea
Abstract
Machine learning is widely used in various practical applications with deep learning models demonstrating advantages in handling huge data. Treating music as a special language and using deep learning models to accomplish melody recognition, music generation, and music analysis has proven feasible. In certain music-related deep learning research, recurrent neural networks have been replaced with transformers. This has achieved significant results. In traditional approaches with recurrent neural networks, input sequences are limited in length. This paper proposes a method to generate chord progressions for melodies using a transformer-based sequence-to-sequence model, which is divided into a pre-trained encoder and decoder. A pre-trained encoder extracts contextual information from melodies, whereas a decoder uses this information to produce chords asynchronously and finally outputs chord progressions. The proposed method addresses length limitation issues while considering the harmony between chord progressions and melodies. Chord progressions can be generated for melodies in practical music composition applications. Evaluation experiments are conducted using the proposed method and three baseline models. The baseline models included the bidirectional long short-term memory (BLSTM), bidirectional encoder representation from transformers (BERT), and generative pre-trained transformer (GPT2). The proposed method outperformed the baseline models in Hits@k (k = 1) by 25.89, 1.54, and 2.13 %, respectively.
Funder
Ministry of Education of the Republic of Korea and the National Research Foundation of Korea
Subject
General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)
Reference26 articles.
1. Ciaburro, G., Iannace, G., and Puyana-Romero, V. (2021, January 17–19). Sentiment Analysis-Based Method to Prevent Cyber Bullying. Proceedings of the 2021 International Conference on Wireless Communications, Networking and Applications, Berlin, Germany.
2. Basaran, D., Essid, S., and Peeters, G. (2018, January 23–27). Main Melody Extraction with Source-Filter NMF and CRNN. Proceedings of the International Society for Music Information Retreival, Paris, France.
3. Li, S., Jang, S., and Sung, Y. (2019). Melody Extraction and Encoding Method for Generating Healthcare Music Automatically. Electronics, 8.
4. Li, S., Jang, S., and Sung, Y. (2019). Automatic Melody Composition Using Enhanced GAN. Mathematics, 7.
5. A Hierarchical Recurrent Neural Network for Symbolic Melody Generation;Wu;IEEE Trans. Cybern.,2019
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献