Author:
Yongqiang Zhao, ,Zhi Jin,Feng Zhang,Haiyan Zhao,Zhengwei Tao,Chengfeng Dou,Xinhai Xu,Donghong Liu
Publisher
Aerospace Information Research Institute, Chinese Academy of Sciences
Subject
Artificial Intelligence,Computer Graphics and Computer-Aided Design,Computer Vision and Pattern Recognition,Human-Computer Interaction
Reference105 articles.
1. Anderson P,Fernando B,Johnson M and Gould S. 2016. SPICE:semantic propositional image caption evaluation//Proceedings of the 14th European Conference on Computer Vision. Amsterdam,the Netherlands:Springer:382-398[DOI:10.1007/978-3-319-46454-1_24]
2. Anderson P,He X D,Buehler C,Teney D,Johnson M,Gould S and Zhang L. 2018. Bottom-up and top-down attention for image captioning and visual question answering//Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City,USA:IEEE:6077-6086[DOI:10.1109/CVPR. 2018. 00636]
3. Aslam A. 2022. Detecting objects in less response time for processing multimedia events in smart cities//Proceedings of 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. New Orleans,USA:IEEE:2043-2053[DOI:10.1109/CVPRW56347.2022.00222]
4. Banerjee S and Lavie A. 2005. METEOR:an automatic metric for MT evaluation with improved correlation with human judgments//Proceedings of the ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization. ANN Arbor,Michigan:ACL:65-73
5. Bengio S,Vinyals O,Jaitly N and Shazeer N. 2015. Scheduled sampling for sequence prediction with recurrent neural networks//Proceedings of the 28th International Conference on Neural Information Processing Systems. Montreal,Canada:MIT Press:1171-1179