1. Brock, A., Donahue, J., Simonyan, K.: Large scale gan training for high fidelity natural image synthesis (2018). https://doi.org/10.48550/ARXIV.1809.11096, https://arxiv.org/abs/1809.11096
2. Child, R.: Very deep vaes generalize autoregressive models and can outperform them on images (2021)
3. Dai, B., Lin, D.: Contrastive learning for image captioning. arXiv preprint arXiv:1710.02534 (2017)
4. DeVries, T., Romero, A., Pineda, L., Taylor, G.W., Drozdzal, M.: On the evaluation of conditional gans. arXiv preprint arXiv:1907.08175 (2019)
5. Donahue, J., Krähenbühl, P., Darrell, T.: Adversarial feature learning. arXiv preprint arXiv:1605.09782 (2016)