83
2

Dimension reduction as an optimization problem over a set of generalized functions

Abstract

Classical dimension reduction problem can be loosely formulated as a problem of finding a kk-dimensional affine subspace of Rn{\mathbb R}^n onto which data points x1,,xN{\mathbf x}_1,\cdots, {\mathbf x}_N can be projected without loss of valuable information. We reformulate this problem in the language of tempered distributions, i.e. as a problem of approximating an empirical probability density function pemp(x)=1Ni=1Nδn(xxi)p_{\rm{emp}}({\mathbf x}) = \frac{1}{N} \sum_{i=1}^N \delta^n (\bold{x} - \bold{x}_i), where δn\delta^n is an nn-dimensional Dirac delta function, by another tempered distribution q(x)q({\mathbf x}) whose density is supported in some kk-dimensional subspace. Thus, our problem is reduced to the minimization of a certain loss function I(q)I(q) measuring the distance from qq to pempp_{\rm{emp}} over a pertinent set of generalized functions, denoted Gk\mathcal{G}_k. Another classical problem of data analysis is the sufficient dimension reduction problem. We show that it can be reduced to the following problem: given a function f:RnRf: {\mathbb R}^n\rightarrow {\mathbb R} and a probability density function p(x)p({\mathbf x}), find a function of the form g(w1Tx,,wkTx)g({\mathbf w}^T_1{\mathbf x}, \cdots, {\mathbf w}^T_k{\mathbf x}) that minimizes the loss Expf(x)g(w1Tx,,wkTx)2{\mathbb E}_{{\mathbf x}\sim p} |f({\mathbf x})-g({\mathbf w}^T_1{\mathbf x}, \cdots, {\mathbf w}^T_k{\mathbf x})|^2. We first show that search spaces of the latter two problems are in one-to-one correspondence which is defined by the Fourier transform. We introduce a nonnegative penalty function R(f)R(f) and a set of ordinary functions Ωϵ={fR(f)ϵ}\Omega_\epsilon = \{f| R(f)\leq \epsilon\} in such a way that Ωϵ\Omega_\epsilon `approximates' the space Gk\mathcal{G}_k when ϵ0\epsilon \rightarrow 0. Then we present an algorithm for minimization of I(f)+λR(f)I(f)+\lambda R(f), based on the idea of two-step iterative computation.

View on arXiv
Comments on this paper