1. Efficient learning of sparse representations with an energy-based model;ranzato;Advances in neural information processing systems,2007
2. Zero-shot text-to-image generation;ramesh;ArXiv Preprint,2021
3. Swin transformer: Hierarchical vision transformer using shifted win-dows;liu;ArXiv Preprint,2021
4. Swin transformer v2: Scaling up capacity and resolution;liu;ArXiv Preprint,2021