Generative adversarial networks in computer vision: image synthesis and manipulation
Author:
Dong Lingfeng1, Huang Yi1, Zou Yuanyang1
Affiliation:
1. 1 Great Wall Cigar Factory of Sichuan China Tobacco Industry Co., Ltd ., Deyang , Sichuan , , China .
Abstract
Abstract
In this paper, we first use the hidden variable loss to generate an adversarial network to optimize the efficiency of the original GAN operation. Secondly, the optimized GAN algorithm is used to train the encoder and decoder to form a new image-processing GAN by combining with the self-encoder and to standardize the operation process of image synthesis. On this basis, the performance of the self-encoder GAN is compared with the original GAN algorithm, and the quality of images generated by the self-encoder GAN and other ways is tested. The test results show that compared with the original GAN framework, the self-encoder GAN has a 50% improvement in operational efficiency in terms of root-mean-square error and an 80% reduction in synthesized global relative error. The self-coder GAN based on this algorithm is 22.5% higher than the traditional BiGAN framework and 2.5% higher than the OGAN framework in terms of FID quality criteria of the generated images and produces 30% less data capacity than BiGAN. The generated images have an average IS score of 3.435, which is superior to other base algorithms.
Publisher
Walter de Gruyter GmbH
Reference18 articles.
1. Krishna, A., Bartake, K., Niu, C., Wang, G., Lai, Y., & Jia, X., et al. (2021). Image synthesis for data augmentation in medical ct using deep reinforcement learning. arXiv e-prints. 2. Shin, M., Seo, Y., Bae, J., Choi, Y. S., Kim, H., & Byun, H., et al. (2023). Ballgan: 3d-aware image synthesis with a spherical background. arXiv e-prints. 3. Zhou, R., Jiang, C., & Xu, Q. (2021). A survey on generative adversarial network-based text-to-image synthesis. Neurocomputing, 451(1). 4. Wang, M., Lang, C., Feng, S., Wang, T., & Li, Y. (2021). Text to photo-realistic image synthesis via chained deep recurrent generative adversarial network. Journal of Visual Communication and Image Representation, 74, 102955. 5. Huang, S., & Chen, Y. (2022). Generative adversarial networks with adaptive semantic normalization for text-to-image synthesis. Digital Signal Processing(120-), 120.
|
|