BEGAN v3: Avoiding Mode Collapse in GANs Using Variational Inference-Reference-Cited by-同舟云学术

BEGAN v3: Avoiding Mode Collapse in GANs Using Variational Inference

Published:2020-04-23 Issue:4 Volume:9 Page:688
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Park Sung-Wook,Huh Jun-Ho,Kim Jong-Chan^ORCID

Abstract

In the field of deep learning, the generative model did not attract much attention until GANs (generative adversarial networks) appeared. In 2014, Google’s Ian Goodfellow proposed a generative model called GANs. GANs use different structures and objective functions from the existing generative model. For example, GANs use two neural networks: a generator that creates a realistic image, and a discriminator that distinguishes whether the input is real or synthetic. If there are no problems in the training process, GANs can generate images that are difficult even for experts to distinguish in terms of authenticity. Currently, GANs are the most researched subject in the field of computer vision, which deals with the technology of image style translation, synthesis, and generation, and various models have been unveiled. The issues raised are also improving one by one. In image synthesis, BEGAN (Boundary Equilibrium Generative Adversarial Network), which outperforms the previously announced GANs, learns the latent space of the image, while balancing the generator and discriminator. Nonetheless, BEGAN also has a mode collapse wherein the generator generates only a few images or a single one. Although BEGAN-CS (Boundary Equilibrium Generative Adversarial Network with Constrained Space), which was improved in terms of loss function, was introduced, it did not solve the mode collapse. The discriminator structure of BEGAN-CS is AE (AutoEncoder), which cannot create a particularly useful or structured latent space. Compression performance is not good either. In this paper, this characteristic of AE is considered to be related to the occurrence of mode collapse. Thus, we used VAE (Variational AutoEncoder), which added statistical techniques to AE. As a result of the experiment, the proposed model did not cause mode collapse but converged to a better state than BEGAN-CS.

Funder

National Research Foundation

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/9/4/688/pdf

Reference46 articles.

1. Deep Learning;Yann;Nature,2015

2. Conditional Generative Adversarial Nets;Mirza,2014

3. Backpropagation: The basic theory;Rumelhart,1995

Cited by 24 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. FabricGAN: an enhanced generative adversarial network for data augmentation and improved fabric defect detection;Textile Research Journal;2024-03-15

2. Multi-solution inverse design in photonics using generative modeling;Journal of the Optical Society of America B;2024-01-30

3. Deep clustering techniques based on generative architectures;Unsupervised and Semi-Supervised Learning;2023-12-22

4. Enhancement for Greenhouse Sustainability Using Tomato Disease Image Classification System Based on Intelligent Complex Controller;Sustainability;2023-11-22

5. Mssgan: Enforcing Multiple Generators to Learn Multiple Subspaces to Avoid the Mode Collapse;Machine Learning and Knowledge Extraction;2023-10-10