Scalable Out-of-distribution Robustness in the Presence of Unobserved Confounders
We consider the task of out-of-distribution (OOD) generalization, where the distribution shift is due to an unobserved confounder () affecting both the covariates () and the labels (). This confounding introduces heterogeneity in the predictor, i.e., , making traditional covariate and label shift assumptions unsuitable. OOD generalization differs from traditional domain adaptation in that it does not assume access to the covariate distribution () of the test samples during training. These conditions create a challenging scenario for OOD robustness: (a) is an unobserved confounder during training, (b) , (c) is unavailable during training, and (d) the predictive distribution depends on . While prior work has developed complex predictors requiring multiple additional variables for identifiability of the latent distribution, we explore a set of identifiability assumptions that yield a surprisingly simple predictor using only a single additional variable. Our approach demonstrates superior empirical performance on several benchmark tasks.
View on arXiv