Author:
Yang Xuewen,Zhang Heming,Jin Di,Liu Yingru,Wu Chi-Hao,Tan Jianchao,Xie Dongliang,Wang Jue,Wang Xin
Publisher
Springer International Publishing
Cited by
26 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. A transformer-based Urdu image caption generation;Journal of Ambient Intelligence and Humanized Computing;2024-07-02
2. PFIDT:Pyramid Focal Inverse Distance Transformer for Crowd Localization;2024 5th International Conference on Information Science, Parallel and Distributed Systems (ISPDS);2024-05-31
3. FashionVLM - Fashion Captioning Using Pretrained Vision Transformer and Large Language Model;2024 International Conference on Emerging Smart Computing and Informatics (ESCI);2024-03-05
4. NLP-Based Fusion Approach to Robust Image Captioning;IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing;2024
5. Improving fashion captioning via attribute-based alignment and multi-level language model;Applied Intelligence;2023-11-25