Deep-learning-based image captioning:analysis and prospects-Reference-Cited by-同舟云学术

Deep-learning-based image captioning:analysis and prospects

Published:2023 Issue:9 Volume:28 Page:2788-2816
ISSN:1006-8961
Container-title:Journal of Image and Graphics
language:en
Short-container-title:

Author:

Yongqiang Zhao, ,Zhi Jin,Feng Zhang,Haiyan Zhao,Zhengwei Tao,Chengfeng Dou,Xinhai Xu,Donghong Liu

Publisher

Aerospace Information Research Institute, Chinese Academy of Sciences

Subject

Artificial Intelligence,Computer Graphics and Computer-Aided Design,Computer Vision and Pattern Recognition,Human-Computer Interaction

Reference105 articles.

1. Anderson P,Fernando B,Johnson M and Gould S. 2016. SPICE:semantic propositional image caption evaluation//Proceedings of the 14th European Conference on Computer Vision. Amsterdam,the Netherlands:Springer:382-398[DOI:10.1007/978-3-319-46454-1_24]

2. Anderson P,He X D,Buehler C,Teney D,Johnson M,Gould S and Zhang L. 2018. Bottom-up and top-down attention for image captioning and visual question answering//Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City,USA:IEEE:6077-6086[DOI:10.1109/CVPR. 2018. 00636]

3. Aslam A. 2022. Detecting objects in less response time for processing multimedia events in smart cities//Proceedings of 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. New Orleans,USA:IEEE:2043-2053[DOI:10.1109/CVPRW56347.2022.00222]

4. Banerjee S and Lavie A. 2005. METEOR:an automatic metric for MT evaluation with improved correlation with human judgments//Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization. ANN Arbor,Michigan:ACL:65-73

5. Bengio S,Vinyals O,Jaitly N and Shazeer N. 2015. Scheduled sampling for sequence prediction with recurrent neural networks//Proceedings of the 28th International Conference on Neural Information Processing Systems. Montreal,Canada:MIT Press:1171-1179