Author:
Gao Peng,Geng Shijie,Zhang Renrui,Ma Teli,Fang Rongyao,Zhang Yongfeng,Li Hongsheng,Qiao Yu
Publisher
Springer Science and Business Media LLC
Subject
Artificial Intelligence,Computer Vision and Pattern Recognition,Software
Reference70 articles.
1. Alayrac, J. B., Donahue, J., Luc, P., et al. (2022). Flamingo: a visual language model for few-shot learning. In A. H. Oh, A. Agarwal, D. Belgrave, et al. (Eds.), Advances in neural information processing systems. MIT Press.
2. Anderson, P., He, X., & Buehler, C., et al. (2018). Bottom-up and top-down attention for image captioning and visual question answering. In CVPR.
3. Bossard, L., Guillaumin, M., & Van Gool, L. (2014). Food-101–mining discriminative components with random forests. In European conference on computer vision, Springer, pp. 446–461.
4. Brown, T., Mann, B., & Ryder, N., et al. (2020). Language models are few-shot learners. In NeurIPS.
5. Carion, N., Massa, F., & Synnaeve, G., et al. (2020). End-to-end object detection with transformers. In ECCV.
Cited by
103 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献