We consider regression models to be estimated on pre-defined strata of a sample. Denote by the theoretical parameter associated to the -th covariate in the -th stratum. It is common practice to first arbitrarily chose a reference stratum, say , and perform inference based on the decomposition . In particular, -penalized regression models can be constructed to recover non-zero parameters among the 's and the 's. In this paper, we present a simple though efficient method that bypasses the arbitrary choice of the reference stratum at no cost. Its implementation can be done with available packages under a variety of models and, in the linear regression model, we show it is sparsistent under conditions similar to those ensuring sparsistency for an oracular version of the reference stratum strategy. Our empirical study further shows that our proposal performs at least as well as its competitors under the considered settings. As a final illustration, an analysis of road safety data is provided.
View on arXiv