ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1508.05476
61
32
v1v2 (latest)

Regression modeling on stratified data: automatic and covariate-specific selection of the reference stratum with simple L1L_1L1​-norm penalties

22 August 2015
E. Ollier
V. Viallon
ArXiv (abs)PDFHTML
Abstract

We consider regression models to be estimated on K>1K>1K>1 pre-defined strata of a sample. Denote by βk,j∗\beta^*_{k,j}βk,j∗​ the theoretical parameter associated to the jjj-th covariate in the kkk-th stratum. It is common practice to first arbitrarily chose a reference stratum, say ℓ\ellℓ, and perform inference based on the decomposition βk,j∗=βℓ,j∗+γk,j∗\beta^*_{k,j} = \beta^*_{\ell,j} + \gamma^*_{k,j}βk,j∗​=βℓ,j∗​+γk,j∗​. In particular, ℓ1\ell_1ℓ1​-penalized regression models can be constructed to recover non-zero parameters among the βℓ,j∗\beta^*_{\ell,j}βℓ,j∗​'s and the γk,j∗ \gamma^*_{k,j}γk,j∗​'s. In this paper, we present a simple though efficient method that bypasses the arbitrary choice of the reference stratum at no cost. Its implementation can be done with available packages under a variety of models and, in the linear regression model, we show it is sparsistent under conditions similar to those ensuring sparsistency for an oracular version of the reference stratum strategy. Our empirical study further shows that our proposal performs at least as well as its competitors under the considered settings. As a final illustration, an analysis of road safety data is provided.

View on arXiv
Comments on this paper