55
v1v2v3v4 (latest)

Connecting model-based and model-free approaches to linear least squares regression

Abstract

In a regression setting with response vector yRn\mathbf{y} \in \mathbb{R}^n and given regressors x1,,xpRn\mathbf{x}_1,\ldots,\mathbf{x}_p \in \mathbb{R}^n, a typical question is to what extent y\mathbf{y} is related to these regressors, specifically, how well can y\mathbf{y} be approximated by a linear combination of them. Classical methods for this question are based on statistical models for the conditional distribution of y\mathbf{y}, given the regressors xj\mathbf{x}_j. In the present paper it is shown that various p-values resulting from this model-based approach have also a purely data-analytic, model-free interpretation. This finding is derived in a rather general context. In addition, we introduce equivalence regions, a reinterpretation of confidence regions in the model-free context.

View on arXiv
Comments on this paper