Language-aware multiple datasets detection pretraining for DETRs
-
Published:2024-11
Issue:
Volume:179
Page:106506
-
ISSN:0893-6080
-
Container-title:Neural Networks
-
language:en
-
Short-container-title:Neural Networks
Reference70 articles.
1. Exploring the limits of large scale pre-training;Abnar,2021 2. Arnab, A., Dehghani, M., Heigold, G., Sun, C., Lučić, M., & Schmid, C. (2021). Vivit: A video vision transformer. In ICCV (pp. 6836–6846). 3. Bar, A., Wang, X., Kantorov, V., Reed, C. J., Herzig, R., Chechik, G., et al. (2022). Detreg: Unsupervised pretraining with region priors for object detection. In CVPR (pp. 14605–14615). 4. Iouformer: Pseudo-IoU prediction with transformer for visual tracking;Cai;Neural Networks,2024 5. Cai, Z., & Vasconcelos, N. (2018). Cascade R-CNN: Delving into high quality object detection. In CVPR (pp. 6154–6162).
|
|