58
0

Addressing pitfalls in implicit unobserved confounding synthesis using explicit block hierarchical ancestral sampling

Abstract

Unbiased data synthesis is crucial for evaluating causal discovery algorithms in the presence of unobserved confounding, given the scarcity of real-world datasets. A common approach, implicit parameterization, encodes unobserved confounding by modifying the off-diagonal entries of the idiosyncratic covariance matrix while preserving positive definiteness. Within this approach, we identify that state-of-the-art protocols have two distinct issues that hinder unbiased sampling from the complete space of causal models: first, we give a detailed analysis of use of diagonally dominant constructions restricts the spectrum of partial correlation matrices; and second, the restriction of possible graphical structures when sampling bidirected edges, unnecessarily ruling out valid causal models. To address these limitations, we propose an improved explicit modeling approach for unobserved confounding, leveraging block-hierarchical ancestral generation of ground truth causal graphs. Algorithms for converting the ground truth DAG into ancestral graph is provided so that the output of causal discovery algorithms could be compared with. We draw connections between implicit and explicit parameterization, prove that our approach fully covers the space of causal models, including those generated by the implicit parameterization, thus enabling more robust evaluation of methods for causal discovery and inference.

View on arXiv
@article{sun2025_2503.09194,
  title={ Addressing pitfalls in implicit unobserved confounding synthesis using explicit block hierarchical ancestral sampling },
  author={ Xudong Sun and Alex Markham and Pratik Misra and Carsten Marr },
  journal={arXiv preprint arXiv:2503.09194},
  year={ 2025 }
}
Comments on this paper