176
v1v2 (latest)

Detection of Correlated Random Vectors

International Symposium on Information Theory (ISIT), 2024
Abstract

In this paper, we investigate the problem of deciding whether two standard normal random vectors XRn\mathsf{X}\in\mathbb{R}^{n} and YRn\mathsf{Y}\in\mathbb{R}^{n} are correlated or not. This is formulated as a hypothesis testing problem, where under the null hypothesis, these vectors are statistically independent, while under the alternative, X\mathsf{X} and a randomly and uniformly permuted version of Y\mathsf{Y}, are correlated with correlation ρ\rho. We analyze the thresholds at which optimal testing is information-theoretically impossible and possible, as a function of nn and ρ\rho. To derive our information-theoretic lower bounds, we develop a novel technique for evaluating the second moment of the likelihood ratio using an orthogonal polynomials expansion, which among other things, reveals a surprising connection to integer partition functions. We also study a multi-dimensional generalization of the above setting, where rather than two vectors we observe two databases/matrices, and furthermore allow for partial correlations between these two.

View on arXiv
Comments on this paper