Sketching for large-scale learning of mixture models-Reference-Cited by-同舟云学术

Sketching for large-scale learning of mixture models

Published:2017-12-22 Issue:3 Volume:7 Page:447-508
ISSN:2049-8764
Container-title:Information and Inference: A Journal of the IMA
language:en
Short-container-title:

Author:

Keriven Nicolas¹,Bourrier Anthony²,Gribonval Rémi¹,Pérez Patrick³

Affiliation:

1. Univ Rennes, Inria, CNRS, IRISA Campus de Beaulieu, Rennes Cedex, France

2. Gipsa-Lab, 11 rue des Mathématiques, Saint-Martin-d’Hères, France

3. Technicolor, 975 Avenue des Champs Blancs, Cesson Sévigné, France

Abstract

Abstract Learning parameters from voluminous data can be prohibitive in terms of memory and computational requirements. We propose a ‘compressive learning’ framework, where we estimate model parameters from a sketch of the training data. This sketch is a collection of generalized moments of the underlying probability distribution of the data. It can be computed in a single pass on the training set and is easily computable on streams or distributed datasets. The proposed framework shares similarities with compressive sensing, which aims at drastically reducing the dimension of high-dimensional signals while preserving the ability to reconstruct them. To perform the estimation task, we derive an iterative algorithm analogous to sparse reconstruction algorithms in the context of linear inverse problems. We exemplify our framework with the compressive estimation of a Gaussian mixture model (GMM), providing heuristics on the choice of the sketching procedure and theoretical guarantees of reconstruction. We experimentally show on synthetic data that the proposed algorithm yields results comparable to the classical expectation-maximization technique while requiring significantly less memory and fewer computations when the number of database elements is large. We further demonstrate the potential of the approach on real large-scale data (over $10^{8}$ training samples) for the task of model-based speaker verification. Finally, we draw some connections between the proposed framework and approximate Hilbert space embedding of probability distributions using random features. We show that the proposed sketching operator can be seen as an innovative method to design translation-invariant kernels adapted to the analysis of GMMs. We also use this theoretical framework to derive preliminary information preservation guarantees, in the spirit of infinite-dimensional compressive sensing.

Funder

European Research Council

Publisher

Oxford University Press (OUP)

Subject

Applied Mathematics,Computational Theory and Mathematics,Numerical Analysis,Statistics and Probability,Analysis

Link

http://academic.oup.com/imaiai/article-pdf/7/3/447/32468246/iax015.pdf

Reference113 articles.

1. Database-friendly random projections. Principles of Database Systems (PODS).;Achlioptas,2001

2. The Multivariate Gaussian Probability Distribution.;Ahrendt,2005

3. Theory of reproducing kernels.;Aronzajn;Trans. Amer. Math. Soc.,1950