Funder
National Natural Science Foundation of China
Reference57 articles.
1. Youtube-8m: A large-scale video classification benchmark;Abu-El-Haija,2016
2. HistoGAN: Controlling Colors of GAN-Generated and Real Images via Color Histograms
3. Musiclm: Generating music from text;Agostinelli,2023
4. Self-Supervised MultiModal Versatile Networks;Alayrac
5. madmom
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Popular Hooks: A Multimodal Dataset of Musical Hooks for Music Understanding and Generation;2024 IEEE International Conference on Multimedia and Expo Workshops (ICMEW);2024-07-15
2. The NES Video-Music Database: A Dataset of Symbolic Video Game Music Paired with Gameplay Videos;Proceedings of the 19th International Conference on the Foundations of Digital Games;2024-05-21
3. GPT-4 Driven Cinematic Music Generation Through Text Processing;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14