Publisher
Springer Nature Singapore
Reference27 articles.
1. Anderson, P., He, X., Buehler, C., et al.: Bottom-up and top-down attention for image captioning and visual question answering. In: CVPR, pp. 6077–6086 (2018)
2. Chen, T., Li, Z., Wu, J., et al.: Improving image captioning with pyramid attention and SC-GAN. Image Vis. Comput. 117, 104340 (2022)
3. Cornia, M., Stefanini, M., Baraldi, L., et al.: Meshed-memory transformer for image captioning. In: CVPR, pp. 10578–10587 (2020)
4. Guo, L., Liu, J., Zhu, X., et al.: Normalized and geometry-aware self-attention network for image captioning. In: CVPR, pp. 10327–10336 (2020)
5. Herdade, S., Kappeler, A., Boakye, K., et al.: Image captioning: transforming objects into words. In: NeurIPS (2019)