Author:
Zhang Yi,Zhang Ce,Hu Xueting,He Zhihai
Publisher
Springer Nature Singapore
Reference45 articles.
1. Lecture Notes in Computer Science;L Bossard,2014
2. Cimpoi, M., Maji, S., Kokkinos, I., Mohamed, S., Vedaldi, A.: Describing textures in the wild. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3606–3613 (2014)
3. Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: a large-scale hierarchical image database. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 248–255 (2009)
4. Dosovitskiy, A., et al.: An image is worth 16$$\times $$16 words: transformers for image recognition at scale. In: International Conference on Learning Representations (2020)
5. Du, Y., Wei, F., Zhang, Z., Shi, M., Gao, Y., Li, G.: Learning to prompt for open-vocabulary object detection with vision-language model. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 14084–14093 (2022)
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Learning to Adapt CLIP for Few-Shot Monocular Depth Estimation;2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV);2024-01-03