1. AudioLDM: Text-to-Audio generation with latent diffusion models;Liu
2. Make-an-audio: Text-to-audio generation with prompt-enhanced diffusion models;Huang
3. WavJourney: Compositional audio creation with large language models;Liu,2023
4. Text-to-Audio Generation using Instruction Guided Latent Diffusion Model
5. AudioCaps: Generating captions for audios in the wild;Kim