Cross-modal Feature Alignment based Hybrid Attentional Generative Adversarial Networks for text-to-image synthesis-Reference-Cited by-同舟云学术

Cross-modal Feature Alignment based Hybrid Attentional Generative Adversarial Networks for text-to-image synthesis

Published:2020-12 Issue: Volume:107 Page:102866
ISSN:1051-2004
Container-title:Digital Signal Processing
language:en
Short-container-title:Digital Signal Processing

Author:

Cheng Qingrong,Gu Xiaodong

Funder

National Natural Science Foundation of China

Publisher

Elsevier BV

Subject

Electrical and Electronic Engineering,Signal Processing,Artificial Intelligence,Applied Mathematics,Computer Vision and Pattern Recognition,Statistics, Probability and Uncertainty,Computational Theory and Mathematics

Reference48 articles.

1. Show, attend and tell: neural image caption generation with visual attention;Xu,2015

2. Image caption with global-local attention;Li,2017

3. VQA: visual question answering;Antol,2017

4. Making the V in VQA matter: elevating the role of image understanding in visual question answering;Goyal,2017

5. Adversarial cross-modal retrieval;Wang,2017

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Adaptive multi-text union for stable text-to-image synthesis learning;Pattern Recognition;2024-08

2. Big Data and AI-Driven Product Design: A Survey;Applied Sciences;2023-08-20

3. Modified GAN with Proposed Feature Set for Text-to-Image Synthesis;International Journal of Pattern Recognition and Artificial Intelligence;2023-03-09

4. Optimized GAN for Text-to-Image Synthesis: Hybrid Whale Optimization Algorithm and Dragonfly Algorithm;Advances in Engineering Software;2022-11

5. Local-global visual interaction attention for image captioning;Digital Signal Processing;2022-10