A survey on deep neural network-based image captioning-Reference-Cited by-同舟云学术

A survey on deep neural network-based image captioning

Published:2018-06-09 Issue:3 Volume:35 Page:445-470
ISSN:0178-2789
Container-title:The Visual Computer
language:en
Short-container-title:Vis Comput

Author:

Liu Xiaoxiao,Xu Qingyang^ORCID,Wang Ning

Funder

National Natural Science Foundation of China

Natural Science Foundation of Shandong Province

Publisher

Springer Science and Business Media LLC

Subject

Computer Graphics and Computer-Aided Design,Computer Vision and Pattern Recognition,Software

Link

http://link.springer.com/content/pdf/10.1007/s00371-018-1566-y.pdf

Reference101 articles.

1. Yan, R., Hauptmann, A.G.: A review of text and image retrieval approaches for broadcast news video. Inf. Retr. 10, 445–484 (2007)

2. Bernardi, R., Cakici, R., Elliott, D., Erdem, A., Erdem, E., Ikizler-Cinbis, N., Keller, F., Muscat, A., Plank, B.: Automatic description generation from images: a survey of models, datasets, and evaluation measures. J. Artif. Intell. Res. 55, 409–442 (2016)

3. Aloimonos, Y., Aloimonos, Y., Aloimonos, Y.: Computer vision and natural language processing: recent approaches in multimedia and robotics. ACM Comput. Surv. 49, 71 (2016)

4. Kuznetsova, P., Ordonez, V., Berg, A.C., Berg, T.L., Choi, Y.: Collective generation of natural image descriptions. In: Meeting of the Association for Computational Linguistics: Long Papers, Korea, Jeju Island, pp. 359–368 (2012)

5. Frome, A., Corrado, G.S., Shlens, J., Bengio, S., Dean, J., Ranzato, M., Mikolov, T.: DeViSE: a deep visual-semantic embedding model. In: International Conference on Neural Information Processing Systems, Neural Information Processing Systems Foundation, Lake Tahoe, pp. 2121–2129 (2013)

Cited by 62 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. From vision to text: A comprehensive review of natural image captioning in medical diagnosis and radiology report generation;Medical Image Analysis;2024-10

2. Novel concept-based image captioning models using LSTM and multi-encoder transformer architecture;Scientific Reports;2024-09-05

3. Fruit fast tracking and recognition of apple picking robot based on improved YOLOv5;IET Image Processing;2024-06-20

4. Advanced Generative Deep Learning Techniques for Accurate Captioning of Images;Wireless Personal Communications;2024-04-29

5. Domain-specific image captioning: a comprehensive review;International Journal of Multimedia Information Retrieval;2024-04-18