Fast Parameter Adaptation for Few-shot Image Captioning and Visual Question Answering
Author:
Affiliation:
1. Southern University of Science and Technology & University of Technology Sydney, Sydney, Australia
2. China Electronics Technology Group Corporation, Beijing, China
3. Zhejiang University, Zhejiang, China
Publisher
ACM
Link
https://dl.acm.org/doi/pdf/10.1145/3240508.3240527
Reference52 articles.
1. Lisa Anne Hendricks Subhashini Venugopalan Marcus Rohrbach Raymond Mooney Kate Saenko Trevor Darrell Junhua Mao Jonathan Huang Alexander Toshev Oana Camburu etal 2016. Deep Compositional Captioning: Describing Novel Object Categories without Paired Training Data. In CVPR . Lisa Anne Hendricks Subhashini Venugopalan Marcus Rohrbach Raymond Mooney Kate Saenko Trevor Darrell Junhua Mao Jonathan Huang Alexander Toshev Oana Camburu et al. 2016. Deep Compositional Captioning: Describing Novel Object Categories without Paired Training Data. In CVPR .
2. VQA: Visual Question Answering
3. Shaojie Bai J Zico Kolter and Vladlen Koltun. 2018. An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling. arXiv preprint arXiv:1803.01271 (2018). Shaojie Bai J Zico Kolter and Vladlen Koltun. 2018. An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling. arXiv preprint arXiv:1803.01271 (2018).
4. Kyunghyun Cho Bart van Merrienboer Caglar Gulcehre Dzmitry Bahdanau Fethi Bougares Holger Schwenk and Yoshua Bengio. 2014. Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation. In EMNLP . Kyunghyun Cho Bart van Merrienboer Caglar Gulcehre Dzmitry Bahdanau Fethi Bougares Holger Schwenk and Yoshua Bengio. 2014. Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation. In EMNLP .
5. Xuanyi Dong Junshi Huang Yi Yang and Shuicheng Yan. 2017a. More Is Less: A More Complicated Network With Less Inference Complexity. In CVPR . Xuanyi Dong Junshi Huang Yi Yang and Shuicheng Yan. 2017a. More Is Less: A More Complicated Network With Less Inference Complexity. In CVPR .
Cited by 35 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Boosting Semi-Supervised Video Captioning via Learning Candidates Adjusters;ACM Transactions on Multimedia Computing, Communications, and Applications;2024-07-11
2. Introduction to Bioinformatics and Machine Learning;Advances in Bioinformatics and Biomedical Engineering;2024-04-05
3. Task-related network based on meta-learning for few-shot knowledge graph completion;Applied Intelligence;2024-04
4. Transductive Cross-Lingual Scene-Text Visual Question Answering;Neural Information Processing;2023-11-14
5. A survey on machine learning from few samples;Pattern Recognition;2023-07
1.学者识别学者识别
2.学术分析学术分析
3.人才评估人才评估
"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370
www.globalauthorid.com
TOP
Copyright © 2019-2024 北京同舟云网络信息技术有限公司 京公网安备11010802033243号 京ICP备18003416号-3