Least kth-Order and Rényi Generative Adversarial Networks

Author:

Bhatia Himesh1,Paul William2,Alajaji Fady3,Gharesifard Bahman4,Burlina Philippe5

Affiliation:

1. Department of Mathematics and Statistics, Queens University, ON K7L 3N6, Canada himesh.bhatia@queensu.ca

2. Johns Hopkins University Applied Physics Laboratory, Laurel, MD 20723, U.S.A. william.paul@jhuapl.edu

3. Department of Mathematics and Statistics, Queens University, ON K7L 3N6, Canada fa@queensu.ca

4. Department of Mathematics and Statistics, Queens University, ON K7L 3N6, Canada bahman.gharesifard@queensu.ca

5. Johns Hopkins University Applied Physics Laboratory, Laurel, MD 20723, and Department of Computer Science, Johns Hopkins University, Baltimore, MD 21218, U.S.A. philippe.burlina@jhuapl.edu

Abstract

Abstract We investigate the use of parameterized families of information-theoretic measures to generalize the loss functions of generative adversarial networks (GANs) with the objective of improving performance. A new generator loss function, least kth-order GAN (LkGAN), is introduced, generalizing the least squares GANs (LSGANs) by using a kth-order absolute error distortion measure with k≥1 (which recovers the LSGAN loss function when k=2). It is shown that minimizing this generalized loss function under an (unconstrained) optimal discriminator is equivalent to minimizing the kth-order Pearson-Vajda divergence. Another novel GAN generator loss function is next proposed in terms of Rényi cross-entropy functionals with order α>0, α≠1. It is demonstrated that this Rényi-centric generalized loss function, which provably reduces to the original GAN loss function as α→1, preserves the equilibrium point satisfied by the original GAN based on the Jensen-Rényi divergence, a natural extension of the Jensen-Shannon divergence. Experimental results indicate that the proposed loss functions, applied to the MNIST and CelebA data sets, under both DCGAN and StyleGAN architectures, confer performance benefits by virtue of the extra degrees of freedom provided by the parameters k and α, respectively. More specifically, experiments show improvements with regard to the quality of the generated images as measured by the Fréchet inception distance score and training stability. While it was applied to GANs in this study, the proposed approach is generic and can be used in other applications of information theory to deep learning, for example, the issues of fairness or privacy in artificial intelligence.

Publisher

MIT Press - Journals

Subject

Cognitive Neuroscience,Arts and Humanities (miscellaneous)

Reference61 articles.

1. Csiszár's cutoff rates for the general hypothesis testing problem;Alajaji;IEEE Transactions on Information Theory,2004

2. Deep variational information bottleneck;Alemi;Proceedings of the 5th International Conference on Learning Representations,2017

3. An inequality on guessing and its applications to sequential decoding;Arikan;IEEE Transactions on Information Theory,1996

Cited by 10 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. A Unifying Generator Loss Function for Generative Adversarial Networks;Entropy;2024-03-27

2. Classification Utility, Fairness, and Compactness via Tunable Information Bottleneck and Rényi Measures;IEEE Transactions on Information Forensics and Security;2024

3. Addressing GAN Training Instabilities via Tunable Classification Losses;IEEE Journal on Selected Areas in Information Theory;2024

4. Fisher information and its extensions based on infinite mixture density functions;Physica A: Statistical Mechanics and its Applications;2023-08

5. D, αG)-GANs: Addressing GAN Training Instabilities via Dual Objectives;2023 IEEE International Symposium on Information Theory (ISIT);2023-06-25

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3