Funder
Natural Science Foundation of Jilin Province
Publisher
Springer Science and Business Media LLC
Reference55 articles.
1. Stefanini M, Cornia M, Baraldi L et al (2023) From show to tell: a survey on deep learning-based image captioning. IEEE Transactions on Pattern Analysis and Machine Intelligence 45(1):539–559. https://doi.org/10.1109/TPAMI.2022.3148210
2. Jia J, Ding X, Pang S et al (2023) Image captioning based on scene graphs: a survey. Expert Syst Appl pp 120698
3. Zohourianshahzadi Z, Kalita JK (2022) Neural attention for image captioning: review of outstanding methods. Artif Intell Rev 55(5):3833–3862
4. Hossain MZ, Sohel F, Shiratuddin MF et al (2019) A comprehensive survey of deep learning for image captioning. ACM Computing Surveys (CsUR) 51(6):1–36
5. Xu K, Ba J, Kiros R et al (2015) Show, attend and tell: Neural image caption generation with visual attention. In: International conference on machine learning, PMLR, pp 2048–2057