Generating knockoffs via conditional independence
- UQCV
Let be a -variate random vector and a knockoff copy of (in the sense of \cite{CFJL18}). A new approach for constructing (henceforth, NA) has been introduced in \cite{JSPI}. NA has essentially three advantages: (i) To build is straightforward; (ii) The joint distribution of can be written in closed form; (iii) is often optimal under various criteria, including mean absolute correlation and reconstructability. However, for NA to apply, the distribution of needs to be of the form for some random element . Our first result is that any probability measure on can be approximated by a probability measure which makes condition (*) true. If is absolutely continuous, the approximation holds in total variation distance. In applications, regarding as the distribution of , this result suggests using the knockoffs based on instead of those based on (which are generally unknown). Our second result is a characterization of the pairs where is obtained via NA. It turns out that is of this type if and only if it can be extended to an infinite sequence so as to satisfy certain invariance conditions. The basic tool for proving this fact is de Finetti's theorem for partially exchangeable sequences.
View on arXiv