Approximation and sampling of multivariate probability distributions in the tensor train decomposition-Reference-Cited by-同舟云学术

Approximation and sampling of multivariate probability distributions in the tensor train decomposition

Published:2019-11-02 Issue:3 Volume:30 Page:603-625
ISSN:0960-3174
Container-title:Statistics and Computing
language:en
Short-container-title:Stat Comput

Author:

Dolgov Sergey,Anaya-Izquierdo Karim,Fox Colin,Scheichl Robert

Abstract

Abstract General multivariate distributions are notoriously expensive to sample from, particularly the high-dimensional posterior distributions in PDE-constrained inverse problems. This paper develops a sampler for arbitrary continuous multivariate distributions that is based on low-rank surrogates in the tensor train format, a methodology that has been exploited for many years for scalable, high-dimensional density function approximation in quantum physics and chemistry. We build upon recent developments of the cross approximation algorithms in linear algebra to construct a tensor train approximation to the target probability density function using a small number of function evaluations. For sufficiently smooth distributions, the storage required for accurate tensor train approximations is moderate, scaling linearly with dimension. In turn, the structure of the tensor train surrogate allows sampling by an efficient conditional distribution method since marginal distributions are computable with linear complexity in dimension. Expected values of non-smooth quantities of interest, with respect to the surrogate distribution, can be estimated using transformed independent uniformly-random seeds that provide Monte Carlo quadrature or transformed points from a quasi-Monte Carlo lattice to give more efficient quasi-Monte Carlo quadrature. Unbiased estimates may be calculated by correcting the transformed random seeds using a Metropolis–Hastings accept/reject step, while the quasi-Monte Carlo quadrature may be corrected either by a control-variate strategy or by importance weighting. We show that the error in the tensor train approximation propagates linearly into the Metropolis–Hastings rejection rate and the integrated autocorrelation time of the resulting Markov chain; thus, the integrated autocorrelation time may be made arbitrarily close to 1, implying that, asymptotic in sample size, the cost per effectively independent sample is one target density evaluation plus the cheap tensor train surrogate proposal that has linear cost with dimension. These methods are demonstrated in three computed examples: fitting failure time of shock absorbers; a PDE-constrained inverse diffusion problem; and sampling from the Rosenbrock distribution. The delayed rejection adaptive Metropolis (DRAM) algorithm is used as a benchmark. In all computed examples, the importance weight-corrected quasi-Monte Carlo quadrature performs best and is more efficient than DRAM by orders of magnitude across a wide range of approximation accuracies and sample sizes. Indeed, all the methods developed here significantly outperform DRAM in all computed examples.

Funder

Engineering and Physical Sciences Research Council

Publisher

Springer Science and Business Media LLC

Subject

Computational Theory and Mathematics,Statistics, Probability and Uncertainty,Statistics and Probability,Theoretical Computer Science

Link

http://link.springer.com/content/pdf/10.1007/s11222-019-09910-z.pdf

Reference54 articles.

1. Atchadé, Y.F.: An adaptive version for the metropolis adjusted langevin algorithm with a truncated drift. Methodol. Comput. Appl. Probab. 8(2), 235–254 (2006). https://doi.org/10.1007/s11009-006-8550-0

2. Ballani, J., Grasedyck, L.: Hierarchical tensor approximation of output quantities of parameter-dependent PDEs. SIAM/ASA J. Uncertain. Quantif. 3(1), 852–872 (2015)

3. Brooks, S., Gelman, A., Jones, G., Meng, X.L. (eds.): Handbook of Markov Chain Monte Carlo. CRC Press, Boca Raton (2011)

4. Christen, J., Fox, C.: A general purpose sampling algorithm for continuous distributions (the t-walk). Bayesian Anal. 5(2), 263–282 (2010)

5. Devroye, L.: Non-Uniform Random Variate Generation. Springer, Berlin (1986)

Cited by 30 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Taming numerical imprecision by adapting the KL divergence to negative probabilities;2024-02-06

2. Probabilistic tensor optimization of quantum circuits for the max−k−cut problem;Physical Review A;2024-01-31

3. Deep Importance Sampling Using Tensor Trains with Application to a Priori and a Posteriori Rare Events;SIAM Journal on Scientific Computing;2024-01-24

4. Tensor train for global optimization problems in robotics;The International Journal of Robotics Research;2023-11-30

5. Generative modeling via tensor train sketching;Applied and Computational Harmonic Analysis;2023-11