Exploring the Effect of High-frequency Components in GANs Training


Li Ziqiang1ORCID,Xia Pengfei1ORCID,Rui Xue1ORCID,Li Bin1ORCID


1. University of Science and Technology of China, Hefei, Anhui, China


Generative Adversarial Networks (GANs) have the ability to generate images that are visually indistinguishable from real images. However, recent studies have revealed that generated and real images share significant differences in the frequency domain. In this article, we argue that the frequency gap is caused by the high-frequency sensitivity of the discriminator. According to our observation, during the training of most GANs, severe high-frequency differences make the discriminator focus on high-frequency components excessively, which hinders the generator from fitting the low-frequency components that are important for learning images’ content. Then, we propose two simple yet effective image pre-processing operations in the frequency domain for eliminating the side effects caused by high-frequency differences in GANs training: High-frequency Confusion (HFC) and High-frequency Filter (HFF). The proposed operations are general and can be applied to most existing GANs at a fraction of the cost. The advanced performance of the proposed operations is verified on multiple loss functions, network architectures, and datasets. Specifically, the proposed HFF achieves significant improvements of 42.5% FID on CelebA (128*128) unconditional generation based on SNGAN, 30.2% FID on CelebA unconditional generation based on SSGAN, and 69.3% FID on CelebA unconditional generation based on InfoMAXGAN. Furthermore, we also adopt HFF as the first attempt at data augmentation in the frequency domain for contrastive learning, achieving state-of-the-art performance on unconditional generation. Code is available at https://github.com/iceli1007/HFC-and-HFF .


National Natural Science Foundation of China


Association for Computing Machinery (ACM)


Computer Networks and Communications,Hardware and Architecture

Reference70 articles.

1. Martin Arjovsky Soumith Chintala and Léon Bottou. 2017. Wasserstein GAN. Retrieved from https://arXiv:1701.07875.

2. Ronen Basri David Jacobs Yoni Kasten and Shira Kritchman. 2019. The convergence rate of neural networks for learned functions of different frequencies. Retrieved from https://arXiv:1906.00425.

3. Mikołaj Bińkowski Dougal J. Sutherland Michael Arbel and Arthur Gretton. 2018. Demystifying MMD GANs. Retrieved from https://arXiv:1801.01401.

4. Yuan Cao Zhiying Fang Yue Wu Ding-Xuan Zhou and Quanquan Gu. 2019. Towards understanding the spectral bias of deep learning. Retrieved from https://arXiv:1912.01198.

5. Ting Chen, Simon Kornblith, Mohammad Norouzi, and Geoffrey Hinton. 2020. A simple framework for contrastive learning of visual representations. In Proceedings of the International Conference on Machine Learning. PMLR, 1597–1607.

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献








Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3