Graphs for margins of Bayesian networks

8 August 2014

Abstract

Directed acyclic graph (DAG) models, also called Bayesian networks, impose conditional independence constraints on a multivariate probability distribution, and are widely used in probabilistic reasoning, machine learning and causal inference. If latent variables are included in such a model, then the set of possible marginal distributions over the remaining (observed) variables is generally complex, and not represented by any DAG. Larger classes of graphical models, such as ancestral graphs and acyclic directed mixed graphs (ADMGs), have been introduced to overcome this; however, in this paper we show that these classes of graphs are not rich enough to fully represent the range of models which can arise as margins of DAG models. We introduce a new class of hyper-graphs, called mDAGs, and a latent projection operation to obtain an mDAG from the margin of a DAG. We show that each distinct marginal of a DAG model is represented by at least one mDAG, and provide graphical results towards characterizing when two such marginal models are the same. Finally we show that mDAGs correctly capture the marginal structure of causally-interpreted DAGs under interventions on the observed variables.

View on arXiv

Comments on this paper