Author:
Wang Lei,He Jiabang,Li Shenshen,Liu Ning,Lim Ee-Peng
Publisher
Springer Nature Switzerland
Reference36 articles.
1. Agrawal, H., et al.: nocaps: novel object captioning at scale. In: 2019 IEEE/CVF International Conference on Computer Vision, ICCV, pp. 8947–8956. IEEE (2019)
2. Bang, Y., et al.: A multitask, multilingual, multimodal evaluation of chatgpt on reasoning, hallucination, and interactivity. CoRR abs/2302.04023 (2023). https://doi.org/10.48550/arXiv.2302.04023. https://doi.org/10.48550/arXiv.2302.04023
3. Biten, A.F., Gómez, L., Karatzas, D.: Let there be a clock on the beach: educing object hallucination in image captioning. In: IEEE/CVF Winter Conference on Applications of Computer Vision, WACV (2022)
4. Brown, T.B., et al.: Language models are few-shot learners. In: NeurIPS (2020)
5. Chung, H.W., et al.: Scaling instruction-finetuned language models. arXiv preprint arXiv:2210.11416 (2022)