Unsupervised Image Translation Using Multi-Scale Residual GAN-Reference-Cited by-同舟云学术

Unsupervised Image Translation Using Multi-Scale Residual GAN

Published:2022-11-19 Issue:22 Volume:10 Page:4347
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Zhang Yifei^ORCID,Li Weipeng,Wang Daling^ORCID,Feng Shi

Abstract

Image translation is a classic problem of image processing and computer vision for transforming an image from one domain to another by learning the mapping between an input image and an output image. A novel Multi-scale Residual Generative Adversarial Network (MRGAN) based on unsupervised learning is proposed in this paper for transforming images between different domains using unpaired data. In the model, a dual generater architecture is used to eliminate the dependence on paired training samples and introduce a multi-scale layered residual network in generators for reducing semantic loss of images in the process of encoding. The Wasserstein GAN architecture with gradient penalty (WGAN-GP) is employed in the discriminator to optimize the training process and speed up the network convergence. Comparative experiments on several image translation tasks over style transfers and object migrations show that the proposed MRGAN outperforms strong baseline models by large margins.

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/10/22/4347/pdf

Reference47 articles.

1. Luan, F.J., Paris, S., Shechtman, E., and Bala, K. (2017, January 21–26). Deep photo style transfer. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.

2. Dumoulin, V., Shlens, J., and Kudlur, M. (2017, January 24–26). A Learned Representation For Artistic Style. Proceedings of the 5th International Conference on Learning Representations (ICLR)—Conference Track Proceedings, Toulon, France.

3. Johnson, J., Alahi, A., and Li, F.F. (2016, January 8–16). Perceptual Losses for Real-Time Style Transfer and Super-Resolution. Proceedings of the European Conference of Computer Vision (ECCV), Amsterdam, The Netherlands.

4. Isola, P., Zhu, J.-Y., Zhou, T.H., and Efros, A.A. (2017, January 21–26). Image-to-Image Translation with Conditional Adversarial Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.

5. Zhu, J.-Y., Park, T., Isola, P., and Efros, A.A. (2017, January 22–29). Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks. Proceedings of the International Conference on Computer Vision (ICCV), Venice, Italy.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Zoom-GAN: learn to colorize multi-scale targets;The Visual Computer;2023-07-07