DIFFUSION-MODEL-ASSISTED SUPERVISED LEARNING OF GENERATIVE MODELS FOR DENSITY ESTIMATION
-
Published:2024
Issue:1
Volume:5
Page:25-38
-
ISSN:2689-3967
-
Container-title:Journal of Machine Learning for Modeling and Computing
-
language:en
-
Short-container-title:J Mach Learn Model Comput
Author:
Liu Yanfang,Yang Minglei,Zhang Zezhong,Bao Feng,Cao Yanzhao,Zhang Guannan
Abstract
We present a supervised learning framework of training generative models for density estimation.
Generative models, including generative adversarial networks (GANs), normalizing flows, and variational auto-encoders (VAEs), are usually considered as unsupervised learning models, because labeled data are usually unavailable for training. Despite the success of the generative models, there are several issues with the unsupervised training, e.g., requirement of reversible architectures, vanishing gradients, and training instability. To enable supervised learning in generative models, we utilize the score-based diffusion model to generate labeled data. Unlike existing diffusion models that train neural networks to learn the score function, we develop a training-free score estimation method. This approach uses mini-batch-based Monte Carlo estimators to directly approximate the score function at any spatial-temporal location in solving an ordinary differential equation (ODE), corresponding to the reverse-time stochastic differential equation (SDE). This approach can offer both high accuracy and substantial time savings in neural network training. Once the labeled data are generated, we can train a simple, fully connected neural network to learn the generative model in the supervised manner. Compared with existing normalizing flow models, our method does not require the use of reversible neural networks and avoids the computation of the Jacobian matrix. Compared with existing diffusion models, our method does not need to solve the reverse-time SDE to generate new samples. As a result, the sampling efficiency is significantly improved. We demonstrate the performance of our method by applying it to a set of 2D datasets as well as real data from the University of California Irvine (UCI) repository.
Reference33 articles.
1. Austin, J., Johnson, D.D., Ho, J., Tarlow, D., and van den Berg, R., Structured Denoising Diffusion Models in Discrete State-Spaces, in Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, Virtual, pp. 17981-17993, December 6-14, 2021. 2. Bao, F., Zhang, Z., and Zhang, G., A Score-Based Nonlinear Filter for Data Assimilation, arXiv: 2306.09282, 2023. 3. Cai, R., Yang, G., Averbuch-Elor, H., Hao, Z., Belongie, S.J., Snavely, N., and Hariharan, B., Learning Gradient Fields for Shape Generation, Computer Vision - ECCV 2020 - 16th European Conf., Glasgow, UK, August 23-28, 2020. 4. Creswell, A., White, T., Dumoulin, V., Arulkumaran, K., Sengupta, B., and Bharath, A.A., Generative Adversarial Networks: An Overview, IEEE Signal Process. Mag., vol. 35, no. 1, pp. 53-65, 2018. 5. Dinh, L., Sohl-Dickstein, J., and Bengio, S., Density Estimation Using Real NVP, arXiv:1605.08803, 2016.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|