Funder
National Natural Science Foundation of China
Natural Science Foundation of Hebei Province
Publisher
Springer Science and Business Media LLC
Subject
Computer Networks and Communications,Hardware and Architecture,Media Technology,Software
Reference53 articles.
1. Anderson P (2018) Bottom-up and top-down attention for image captioning and visual question answering. In: Proc IEEE/CVF Conf Comput Vis Pattern Recognit(CVPR), Boston, USA, pp 6077–6086
2. Aswani AV, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser L, Polosukhin I (2017) Attention is all you need. In: Proc Adv Neural Inf Process Syst(NIPS), USA, pp 5998–6008
3. Barlas G, Veinidis C, Arampatzis A (2021) What we see in a photograph: content selection for image captioning. Vis Comput 37:1309–1326. https://doi.org/10.1007/s00371-020-01867-9
4. Cao D, Zhu M, Gao L (2019) An image caption method based on object detection. Multimed Tools Appl 78:35329–35350. https://doi.org/10.1007/s11042-019-08116-9
5. Chang YS (2018) Fine-grained attention for image caption generation. Multimed Tools Appl 77:2959–2971. https://doi.org/10.1007/s11042-017-4593-1