1. Imagen video: High definition video generation with diffusion models;ho;ArXiv Preprint,2022
2. CMCGAN: A Uniform Framework for Cross-Modal Visual-Audio Mutual Generation
3. Video diffusion models;ho;NeurIPS,0
4. Denoising diffusion probabilistic models;ho;NeurIPS,0
5. Discrete contrastive diffusion for cross-modal and conditional generation;zhu;ArXiv Preprint,2022