1. Alayrac, J.-B., Donahue, J., Luc, P., Miech, A., Barr, I., Hasson, Y., et al. (2022). Flamingo: A visual language model for few-shot learning. Advances in Neural Information Processing Systems, 35, 23716–23736.
2. Altieri, L., Cocchi, D., & Roli, G. (2018). SpatEntropy: Spatial Entropy Measures in R. arxiv:1804.05521.
3. Asano, Y. M., Rupprecht, C., & Vedaldi, A. (2020). A critical analysis of self-supervision, or what we can learn from a single image. ICLR: OpenReview.net.
4. Bachmann, R., Mizrahi, D., Atanov, A., & Zamir, A. (2022). Multimae: Multimodal multi-task masked autoencoders. ECCV (37) (Vol. 13697, pp. 348–367). Springer.
5. Bai, Y., Mei, J., Yuille, A.L., & Xie, C. (2021). Are transformers more robust than cnns? Neurips (pp. 26831–26843).