ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1707.04673
18
84

Learning linear structural equation models in polynomial time and sample complexity

15 July 2017
Asish Ghoshal
Jean Honorio
    CML
ArXivPDFHTML
Abstract

The problem of learning structural equation models (SEMs) from data is a fundamental problem in causal inference. We develop a new algorithm --- which is computationally and statistically efficient and works in the high-dimensional regime --- for learning linear SEMs from purely observational data with arbitrary noise distribution. We consider three aspects of the problem: identifiability, computational efficiency, and statistical efficiency. We show that when data is generated from a linear SEM over ppp nodes and maximum degree ddd, our algorithm recovers the directed acyclic graph (DAG) structure of the SEM under an identifiability condition that is more general than those considered in the literature, and without faithfulness assumptions. In the population setting, our algorithm recovers the DAG structure in O(p(d2+log⁡p))\mathcal{O}(p(d^2 + \log p))O(p(d2+logp)) operations. In the finite sample setting, if the estimated precision matrix is sparse, our algorithm has a smoothed complexity of O~(p3+pd7)\widetilde{\mathcal{O}}(p^3 + pd^7)O(p3+pd7), while if the estimated precision matrix is dense, our algorithm has a smoothed complexity of O~(p5)\widetilde{\mathcal{O}}(p^5)O(p5). For sub-Gaussian noise, we show that our algorithm has a sample complexity of O(d8ε2log⁡(pδ))\mathcal{O}(\frac{d^8}{\varepsilon^2} \log (\frac{p}{\sqrt{\delta}}))O(ε2d8​log(δ​p​)) to achieve ε\varepsilonε element-wise additive error with respect to the true autoregression matrix with probability at most 1−δ1 - \delta1−δ, while for noise with bounded (4m)(4m)(4m)-th moment, with mmm being a positive integer, our algorithm has a sample complexity of O(d8ε2(p2δ)1/m)\mathcal{O}(\frac{d^8}{\varepsilon^2} (\frac{p^2}{\delta})^{1/m})O(ε2d8​(δp2​)1/m).

View on arXiv
Comments on this paper