20
0

A Skewness-Based Criterion for Addressing Heteroscedastic Noise in Causal Discovery

Abstract

Real-world data often violates the equal-variance assumption (homoscedasticity), making it essential to account for heteroscedastic noise in causal discovery. In this work, we explore heteroscedastic symmetric noise models (HSNMs), where the effect YY is modeled as Y=f(X)+σ(X)NY = f(X) + \sigma(X)N, with XX as the cause and NN as independent noise following a symmetric distribution. We introduce a novel criterion for identifying HSNMs based on the skewness of the score (i.e., the gradient of the log density) of the data distribution. This criterion establishes a computationally tractable measurement that is zero in the causal direction but nonzero in the anticausal direction, enabling the causal direction discovery. We extend this skewness-based criterion to the multivariate setting and propose SkewScore, an algorithm that handles heteroscedastic noise without requiring the extraction of exogenous noise. We also conduct a case study on the robustness of SkewScore in a bivariate model with a latent confounder, providing theoretical insights into its performance. Empirical studies further validate the effectiveness of the proposed method.

View on arXiv
@article{lin2025_2410.06407,
  title={ A Skewness-Based Criterion for Addressing Heteroscedastic Noise in Causal Discovery },
  author={ Yingyu Lin and Yuxing Huang and Wenqin Liu and Haoran Deng and Ignavier Ng and Kun Zhang and Mingming Gong and Yi-An Ma and Biwei Huang },
  journal={arXiv preprint arXiv:2410.06407},
  year={ 2025 }
}
Comments on this paper