Conditional Embedding Pre-Training Language Model for Image Captioning-Reference-Cited by-同舟云学术

Conditional Embedding Pre-Training Language Model for Image Captioning

Published:2022-06-14 Issue:6 Volume:54 Page:4987-5003
ISSN:1370-4621
Container-title:Neural Processing Letters
language:en
Short-container-title:Neural Process Lett

Author:

Li Pengfei,Zhang Min^ORCID,Lin Peijie,Wan Jian,Jiang Ming

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Computer Networks and Communications,General Neuroscience,Software

Link

https://link.springer.com/content/pdf/10.1007/s11063-022-10844-3.pdf

Reference56 articles.

1. Kiros R, Salakhutdinov R, Zemel R (2014) Multi-modal neural language models, In: Proceedings of the 31st international conference on machine learning, pp 595–603

2. Xu K, Ba JL, Kiros R, Cho K, Courville AC, Salakhudinov R, Zemel R, Bengio Y (2015) Show, attend and tell: Neural image caption generation with visual attention, In: Proceedings of the 32nd international conference on machine learning, pp 2048–2057

3. Anderson P, He X, Buehler C, Teney D, Johnson M, Gould S, Zhang L (2018) Bottom-up and top-down attention for image captioning and visual question answering, In: 2018 IEEE conference on computer vision and pattern recognition, pp 6077–6086

4. Mathews AP, Xie L, He X (2016) Senticap: Generating image descriptions with sentiments, In: Proceedings of the 30th AAAI conference on artificial intelligence, pp 3574-3580

5. Devlin J, Chang M-W, Lee K, Toutanova K (2019) Bert: Pre-training of deep bidirectional transformers for language understanding, In: Proceedings of the 2019 conference of the North American chapter of the association for computational linguistics, pp 4171-4186

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Mixed inference machine reading comprehension method based on symbolic logic;Intelligent Systems with Applications;2024-03

2. COREN: Multi-Modal Co-Occurrence Transformer Reasoning Network for Image-Text Retrieval;Neural Processing Letters;2022-12-22