106
25

Generalized Determinantal Point Processes: The Linear Case

Abstract

A determinantal point process (DPP) over a universe {1,,m}\{1,\ldots,m\} with respect to an m×mm \times m positive semidefinite matrix LL is a probability distribution where the probability of a subset S{1,,m}S \subseteq \{1,\ldots,m\} is proportional to the determinant of the principal minor of LL corresponding to S.S. DPPs encapsulate a wide variety of known distributions and appear naturally (and surprisingly) in a wide variety of areas such as physics, mathematics and computer science. Several applications that use DPPs rely on the fact that they are computationally tractable -- i.e., there are algorithms for sampling from DPPs efficiently. Recently, there is growing interest in studying a generalization of DPPs in which the support of the distribution is a restricted family B of subsets of {1,2,,m}\{1,2,\ldots, m\}. Mathematically, these distributions, which we call generalized DPPs, include the well-studied hardcore distributions as special cases (when LL is diagonal). In applications, they can be used to refine models based on DPPs by imposing combinatorial constraints on the support of the distribution. In this paper we take first steps in a systematic study of computational questions concerning generalized DPPs. We introduce a natural class of linear families: roughly, a family B is said to be linear if there is a collection of pp linear forms that all elements of B satisfy. Important special cases of linear families are all sets of cardinality kk -- giving rise to kk-DPPs -- and, more generally, partition matroids. On the positive side, we prove that, when pp is a constant, there is an efficient, exact sampling algorithm for linear DPPs. We complement these results by proving that, when pp is large, the computational problem related to such DPPs becomes #\#P-hard. Our proof techniques rely and build on the interplay between polynomials and probability distributions.

View on arXiv
Comments on this paper