A Comparative Study of Generative Adversarial Networks for Text-to-Image Synthesis
Author:
Affiliation:
1. Chandigarh College of Engineering and Technology, India
2. Concordia University, Canada
3. Indraprastha Institute of Information Technology, India
Abstract
Text-to-picture alludes to the conversion of a textual description into a semantically similar image.The automatic synthesis of top-quality pictures from text portrayals is both exciting and useful at the same time.Current AI systems have shown significant advances in the field,but the work is still far from complete. Recent advances in the field of Deep Learning have resulted in the introduction of generative models that are capable of generating realistic images when trained appropriately.In this paper,authors will review the advancements in architectures for solving the problem of image synthesis using a text description.They begin by studying the concepts of the standard GAN, how the DCGAN has been used for the task at hand is followed by the StackGAN with uses a stack of two GANs to generate an image through iterative refinement & StackGAN++ which uses multiple GANs in a tree-like structure making the task of generating images from the text more generalized. They look at the AttnGAN which uses an attentional model to generate sub-regions of an image based on the description.
Publisher
IGI Global
Subject
Pharmacology (medical)
Reference28 articles.
1. Barratt & Sharma. (2018). A Note on the Inception Score. arXiv, 1801.01973.
2. The laplacian pyramid as a compact image code.;P.Burt;IEEE Transactions on Communications,1983
3. Denton, E., Chintala, S., Szlam, A., & Fergus, R. (2015a). Deep generative image models using a laplacian pyramid of adversarial networks. CoRR, arXiv, 1506.05751.
4. Elakkiya, Vijayakumar, & Kumar. (2021). An optimized Generative Adversarial Network based continuous sign language classification. Expert Systems with Applications, 182. https://doi.org/10.1016/j.eswa.2021.115276
5. Elakkiya, R., Vijayakumar, P., & Karuppiah, M. (2021). COVID_SCREENET: COVID-19 Screening in Chest Radiography Images Using Deep Transfer Stacking. Information Systems Frontiers: A Journal of Research and Innovation, 1–15. Advance online publication. https://doi.org/10.1007/s10796-021-10123-x
Cited by 25 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Semantic Web Techniques for Extracting and Analyzing of Cropland Abandonment in Hilly Areas;International Journal on Semantic Web and Information Systems;2024-07-29
2. A Review of Semantic Medical Image Segmentation Based on Different Paradigms;International Journal on Semantic Web and Information Systems;2024-06-06
3. Reversible-Prior-Based Spectral-Spatial Transformer for Efficient Hyperspectral Image Reconstruction;International Journal on Semantic Web and Information Systems;2024-05-31
4. Cloud-Based Image Segmentation Approach for Internet of Drones (IoD);IEEE INFOCOM 2024 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS);2024-05-20
5. Enhancing Music Generation With a Semantic-Based Sequence-to-Music Transformer Framework;International Journal on Semantic Web and Information Systems;2024-05-16
1.学者识别学者识别
2.学术分析学术分析
3.人才评估人才评估
"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370
www.globalauthorid.com
TOP
Copyright © 2019-2024 北京同舟云网络信息技术有限公司 京公网安备11010802033243号 京ICP备18003416号-3