1. Z. Wu, Y. Xiong, S.X. Yu, D. Lin, Unsupervised feature learning via non-parametric instance discrimination, in: CVPR, 2018, pp. 3733–3742.
2. Z. Xie, Z. Zhang, Y. Cao, Y. Lin, J. Bao, Z. Yao, Q. Dai, H. Hu, Simmim: A simple framework for masked image modeling, in: CVPR, 2022, pp. 9653–9663.
3. A simple framework for contrastive learning of visual representations;Chen,2020
4. Improving language understanding by generative pre-training;Radford,2018
5. J. Devlin, M.-W. Chang, K. Lee, K. Toutanova, BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding, in: ACL, 2019, pp. 1877–1901.