Author:
Biroli Giulio,Mézard Marc
Abstract
Abstract
Generative models based on diffusion have become the state of the art in the last few years, notably for image generation. Here, we analyze them in the high-dimensional limit, where data are formed by a very large number of variables. We use methods from statistical physics and focus on two well-controlled high-dimensional cases: a Gaussian model and the Curie–Weiss model of ferromagnetism. In the latter case, we highlight the mechanism of symmetry breaking in the inverse diffusion, and point out that, in order to reconstruct the relative asymmetry of the two low-temperature states, and thus to obtain the correct probability weights, one needs a database with a number of points much larger than the dimension of each data point. We characterize the scaling laws in the number of data and in the number of dimensions for an efficient generation.
Subject
Statistics, Probability and Uncertainty,Statistics and Probability,Statistical and Nonlinear Physics
Reference16 articles.
1. Reverse-time diffusion equation models;Anderson;Stoch. Process. Appl.,1982
2. Eigenvectors distribution and quantum unique ergodicity for deformed Wigner matrices;Benigni;Ann. Inst. H. Poincare Probab. Stat.,2020
3. Convergence of denoising diffusion models under the manifold hypothesis;De Bortoli,2022
4. Diffusion Schrödinger bridge with applications to score-based generative modeling;De Bortoli,2021
5. The total variation distance between high-dimensional Gaussians with the same mean;Devroye,2018
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献