Author:
Tang Pengjie,Rao Hong,Zhang Ai,Tan Yunlan
Funder
National Natural Science Foundation of China
Scientific Research Foundation of Education Bureau of Jiangxi Province
Jiangxi Provincial Natural Science Foundation
Publisher
Springer Science and Business Media LLC
Reference57 articles.
1. Banerjee S, Lavie A (2005) Meteor: an automatic metric for mt evaluation with improved correlation with human judgments. In: Annual Meeting of the Association for Computational Linguistics Workshop, pp 65–72
2. Chang X, Yu Y, Yang Y et al (2017) Semantic pooling for complex event analysis in untrimmed videos. IEEE Trans Pattern Anal Mach Intell 39(8):1617–1632
3. Chang X, Ren P, Xu P et al (2023) A comprehensive survey of scene graphs: generation and application. IEEE Trans Pattern Anal Mach Intell 45(1):1–26
4. Chen S, Jiang Y (2019) Motion guided spatial attention for video captioning. In: AAAI Conference on artificial intelligence, pp 8191–8198
5. Chen T, Zhang Z, You Q, et al (2018) “factual” or “emotional”: Stylized image captioning with adaptive learning and attention. In: European Conference on computer vision, pp 527–543