1. Achiam, J., et al.: GPT-4 technical report. arXiv preprint arXiv:2303.08774 (2023)
2. Anil, R., et al.: PaLM 2 technical report. arXiv preprint arXiv:2305.10403 (2023)
3. Brown, T., et al.: Language models are few-shot learners. In: Advances in Neural Information Processing Systems, vol. 33, pp. 1877–1901 (2020)
4. Caesar, H., et al.: nuScenes: a multimodal dataset for autonomous driving. In: CVPR (2020)
5. Cao, X., et al.: MAPLM: a real-world large-scale vision-language dataset for map and traffic scene understanding (2023). https://github.com/LLVM-AD/MAPLM