398
v1v2 (latest)

Provable ICA with Unknown Gaussian Noise, and Implications for Gaussian Mixtures and Autoencoders

Abstract

We present a new algorithm for Independent Component Analysis (ICA) which has provable performance guarantees. In particular, suppose we are given samples of the form y=Ax+ηy = Ax + \eta where AA is an unknown n×nn \times n matrix and xx is a random variable whose components are independent and have a fourth moment strictly less than that of a standard Gaussian random variable and η\eta is an nn-dimensional Gaussian random variable with unknown covariance Σ\Sigma: We give an algorithm that provable recovers AA and Σ\Sigma up to an additive ϵ\epsilon and whose running time and sample complexity are polynomial in nn and 1/ϵ1 / \epsilon. To accomplish this, we introduce a novel "quasi-whitening" step that may be useful in other contexts in which the covariance of Gaussian noise is not known in advance. We also give a general framework for finding all local optima of a function (given an oracle for approximately finding just one) and this is a crucial step in our algorithm, one that has been overlooked in previous attempts, and allows us to control the accumulation of error when we find the columns of AA one by one via local search.

View on arXiv
Comments on this paper