Funder
Consejo Nacional de Ciencia y Tecnología
Subject
Computer Vision and Pattern Recognition,Signal Processing,Software
Reference207 articles.
1. Aafaq, N., Akhtar, N., Liu, W., Gilani, S.Z., Mian, A., 2019a. Spatio-temporal dynamics and semantic attribute enriched visual encoding for video captioning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 12487–12496.
2. Video description: A survey of methods, datasets, and evaluation metrics;Aafaq;ACM Comput. Surv.,2019
3. Video captioning using deep learning: An overview of methods, datasets and metrics;Amaresh,2019
4. Anderson, P., He, X., Buehler, C., Teney, D., Johnson, M., Gould, S., Zhang, L., 2018. Bottom-up and top-down attention for image captioning and visual question answering. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 6077–6086.
5. Bai, Y., Wang, J., Long, Y., Hu, B., Song, Y., Pagnucco, M., Guan, Y., 2021. Discriminative latent semantic graph for video captioning. In: Proceedings of the 29th ACM International Conference on Multimedia. pp. 3556–3564.
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献