A Skewness-Based Criterion for Addressing Heteroscedastic Noise in Causal Discovery

Real-world data often violates the equal-variance assumption (homoscedasticity), making it essential to account for heteroscedastic noise in causal discovery. In this work, we explore heteroscedastic symmetric noise models (HSNMs), where the effect is modeled as , with as the cause and as independent noise following a symmetric distribution. We introduce a novel criterion for identifying HSNMs based on the skewness of the score (i.e., the gradient of the log density) of the data distribution. This criterion establishes a computationally tractable measurement that is zero in the causal direction but nonzero in the anticausal direction, enabling the causal direction discovery. We extend this skewness-based criterion to the multivariate setting and propose SkewScore, an algorithm that handles heteroscedastic noise without requiring the extraction of exogenous noise. We also conduct a case study on the robustness of SkewScore in a bivariate model with a latent confounder, providing theoretical insights into its performance. Empirical studies further validate the effectiveness of the proposed method.
View on arXiv@article{lin2025_2410.06407, title={ A Skewness-Based Criterion for Addressing Heteroscedastic Noise in Causal Discovery }, author={ Yingyu Lin and Yuxing Huang and Wenqin Liu and Haoran Deng and Ignavier Ng and Kun Zhang and Mingming Gong and Yi-An Ma and Biwei Huang }, journal={arXiv preprint arXiv:2410.06407}, year={ 2025 } }