1. VideoGPT: Video generation using VQ-VAE and transformers;yan;arXiv 2104 10157,2021
2. Music SketchNet: Controllable music generation via factorized representations of pitch and rhythm;chen;arXiv 2008 01291,2020
3. Generative pretraining from pixels;chen;Proc 37th Int Conf Mach Learn,2020
4. A hierarchical latent vector model for learning long-term structure in music;roberts;arXiv 1803 05428,2018