1. Auto-encoding variational bayes;kingma;arXiv preprint arXiv 1312 6114,2013
2. Stable diffusion with diffusers;patil;Hugging Face Blog,2022
3. U-net: Convolutional networks for biomedical image segmentation;ronneberger;Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015 18th International Conference Munich Germany October 5-9 2015 Proceedings Part III 18,2015
4. An image is worth 16x16 words: Transformers for image recognition at scale;dosovitskiy;arXiv preprint arXiv 2010 11419,2020
5. Blip: Bootstrapping language-image pre-training for unified vision-language understanding and generation;li;International Conference on Machine Learning,2022