Author:
Zhong Xian,Nie Guozhang,Huang Wenxin,Liu Wenxuan,Ma Bo,Lin Chia-Wen
Subject
Electrical and Electronic Engineering,Computer Vision and Pattern Recognition,Media Technology,Signal Processing
Reference46 articles.
1. Deep visual-semantic alignments for generating image descriptions;Karpathy;IEEE Trans. Pattern Anal. Mach. Intell.,2017
2. T. Yao, Y. Pan, Y. Li, T. Mei, Exploring visual relationship for image captioning, in: Proc. Springer Eur. Conf. Comput. Vis. (ECCV), 2018, pp. 711–727.
3. L. Li, S. Tang, L. Deng, Y. Zhang, Q. Tian, Image caption with global-local attention, in: Proc. Conf. Artif. Intell. (AAAI), 2017, pp. 4133–4139.
4. Babytalk: Understanding and generating simple image descriptions;Kulkarni;IEEE Trans. Pattern Anal. Mach. Intell.,2013
5. H. Hu, J. Gu, Z. Zhang, J. Dai, Y. Wei, Relation networks for object detection, in: Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. (CVPR), 2018, pp. 3588–3597.
Cited by
18 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献