83
v1v2 (latest)

Robust testing of low-dimensional functions

Abstract

A natural problem in high-dimensional inference is to decide if a classifier f:Rn{1,1}f:\mathbb{R}^n \rightarrow \{-1,1\} depends on a small number of linear directions of its input data. Call a function g:Rn{1,1}g: \mathbb{R}^n \rightarrow \{-1,1\}, a linear kk-junta if it is completely determined by some kk-dimensional subspace of the input space. A recent work of the authors showed that linear kk-juntas are testable. Thus there exists an algorithm to distinguish between: 1. f:Rn{1,1}f: \mathbb{R}^n \rightarrow \{-1,1\} which is a linear kk-junta with surface area ss, 2. ff is ϵ\epsilon-far from any linear kk-junta with surface area (1+ϵ)s(1+\epsilon)s, where the query complexity of the algorithm is independent of the ambient dimension nn. Following the surge of interest in noise-tolerant property testing, in this paper we prove a noise-tolerant (or robust) version of this result. Namely, we give an algorithm which given any c>0c>0, ϵ>0\epsilon>0, distinguishes between 1. f:Rn{1,1}f: \mathbb{R}^n \rightarrow \{-1,1\} has correlation at least cc with some linear kk-junta with surface area ss. 2. ff has correlation at most cϵc-\epsilon with any linear kk-junta with surface area at most ss. The query complexity of our tester is kpoly(s/ϵ)k^{\mathsf{poly}(s/\epsilon)}. Using our techniques, we also obtain a fully noise tolerant tester with the same query complexity for any class C\mathcal{C} of linear kk-juntas with surface area bounded by ss. As a consequence, we obtain a fully noise tolerant tester with query complexity kO(poly(logk/ϵ))k^{O(\mathsf{poly}(\log k/\epsilon))} for the class of intersection of kk-halfspaces (for constant kk) over the Gaussian space. Our query complexity is independent of the ambient dimension nn. Previously, no non-trivial noise tolerant testers were known even for a single halfspace.

View on arXiv
Comments on this paper