1. Zero-Shot Object Detection
2. Language models are few-shot learners;Brown;Advances in neural information processing systems,2020
3. Improved baselines with momentum contrastive learning;Chen,2020
4. ImageNet: A large-scale hierarchical image database
5. Bert: Pre-training of deep bidirectional transformers for language understanding;Devlin,2018