Author:
Tzelepi Maria,Mezaris Vasileios
Publisher
Springer Nature Switzerland
Reference28 articles.
1. Achiam, J., et al.: GPT-4 technical report. arXiv preprint arXiv:2303.08774 (2023)
2. Ao, T., Zhang, Z., Liu, L.: GestureDiffuCLIP: gesture diffusion model with CLIP latents. arXiv preprint arXiv:2303.14613 (2023)
3. Aubakirova, D., Gerdes, K., Liu, L.: PatFig: generating short and long captions for patent figures. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 2843–2849 (2023)
4. Brown, T., et al.: Language models are few-shot learners. In: Advances in Neural Information Processing Systems, vol. 33, pp. 1877–1901 (2020)
5. Fu, D., et al.: Drive like a human: rethinking autonomous driving with large language models. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 910–919 (2024)