RegionCLIP: Region-based Language-Image Pretraining-Reference-Cited by-同舟云学术

RegionCLIP: Region-based Language-Image Pretraining

Published:2022-06 Issue: Volume: Page:
ISSN:
Container-title:2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
language:
Short-container-title:

Author:

Zhong Yiwu¹,Yang Jianwei²,Zhang Pengchuan²,Li Chunyuan²,Codella Noel³,Li Liunian Harold⁴,Zhou Luowei³,Dai Xiyang³,Yuan Lu³,Li Yin¹,Gao Jianfeng²

Affiliation:

1. University of Wisconsin-Madison

2. Microsoft Research

3. Microsoft Cloud + AI

4. UCLA

Publisher

IEEE

Link

http://xplorestaging.ieee.org/ielx7/9878378/9878366/09878561.pdf?arnumber=9878561

Reference66 articles.

1. Zero-Shot Object Detection: Joint Recognition and Localization of Novel Concepts

2. Improved Visual-Semantic Alignment for Zero-Shot Object Detection

3. ViL-BERT: Pretraining task-agnostic visiolinguistic representations for vision-and-language tasks;lu;Advances in Neu-ral Information Processing Systems (NeurIPS),2019

4. Microsoft COCO: Common objects in context;lin;ECCV,0

5. Oscar: Object-semantics aligned pre-training for vision-language tasks;li;ECCV,2020

Cited by 138 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. ESC-ZSAR: Expanded Semantics from Categories with Cross-Attention for Zero-Shot Action Recognition;Expert Systems with Applications;2024-12

2. Text-guided Graph Temporal Modeling for few-shot video classification;Engineering Applications of Artificial Intelligence;2024-11

3. Prompt-guided DETR with RoI-pruned masked attention for open-vocabulary object detection;Pattern Recognition;2024-11

4. Ensembling disentangled domain-specific prompts for domain generalization;Knowledge-Based Systems;2024-10

5. A robust defect detection method with a generalization enhancer and cross-modality aggregator for cylinder bores;Engineering Applications of Artificial Intelligence;2024-10