Perturbative construction of mean-field equations in extensive-rank matrix factorization and denoising-Reference-Cited by-同舟云学术

Perturbative construction of mean-field equations in extensive-rank matrix factorization and denoising

Published:2022-08-01 Issue:8 Volume:2022 Page:083301
ISSN:1742-5468
Container-title:Journal of Statistical Mechanics: Theory and Experiment
language:
Short-container-title:J. Stat. Mech.

Author:

Maillard Antoine,Krzakala Florent,Mézard Marc,Zdeborová Lenka

Abstract

Abstract Factorization of matrices where the rank of the two factors diverges linearly with their sizes has many applications in diverse areas such as unsupervised representation learning, dictionary learning or sparse coding. We consider a setting where the two factors are generated from known component-wise independent prior distributions, and the statistician observes a (possibly noisy) component-wise function of their matrix product. In the limit where the dimensions of the matrices tend to infinity, but their ratios remain fixed, we expect to be able to derive closed form expressions for the optimal mean squared error on the estimation of the two factors. However, this remains a very involved mathematical and algorithmic problem. A related, but simpler, problem is extensive-rank matrix denoising, where one aims to reconstruct a matrix with extensive but usually small rank from noisy measurements. In this paper, we approach both these problems using high-temperature expansions at fixed order parameters. This allows to clarify how previous attempts at solving these problems failed at finding an asymptotically exact solution. We provide a systematic way to derive the corrections to these existing approximations, taking into account the structure of correlations particular to the problem. Finally, we illustrate our approach in detail on the case of extensive-rank matrix denoising. We compare our results with known optimal rotationally-invariant estimators, and show how exact asymptotic calculations of the minimal error can be performed using extensive-rank matrix integrals.

Publisher

IOP Publishing

Subject

Statistics, Probability and Uncertainty,Statistics and Probability,Statistical and Nonlinear Physics

Link

https://iopscience.iop.org/article/10.1088/1742-5468/ac7e4c/pdf

Reference74 articles.

1. Rotational invariant estimator for general noisy matrices;Bun;IEEE Trans. Inf. Theory,2016

2. A blind source separation technique using second-order statistics;Belouchrani;IEEE Trans. Signal Process.,1997

3. Phase transition of the largest eigenvalue for nonnull complex sample covariance matrices;Baik;Ann. Probab.,2005

4. Instanton approach to large n Harish–Chandra–Itzykson–Zuber integrals;Bun;Phys. Rev. Lett.,2014

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The decimation scheme for symmetric matrix factorization;Journal of Physics A: Mathematical and Theoretical;2024-02-12

2. Singular vectors of sums of rectangular random matrices and optimal estimation of high-rank signals: The extensive spike model;Physical Review E;2023-11-20

3. Matrix factorization with neural networks;Physical Review E;2023-06-27

4. Rectangular Rotational Invariant Estimator for General Additive Noise Matrices;2023 IEEE International Symposium on Information Theory (ISIT);2023-06-25

5. Gradient flow on extensive-rank positive semi-definite matrix denoising;2023 IEEE Information Theory Workshop (ITW);2023-04-23