Connecting model-based and model-free approaches to linear least squares
regression
In a regression setting with response vector and given regressor vectors , a typical question is to what extent is related to these regressor vectors, specifically, how well can be approximated by a linear combination of them. Classical methods for this question are based on statistical models for the conditional distribution of , given the regressor vectors . Davies and Duembgen (2020) proposed a model-free approach in which all observation vectors and are viewed as fixed, and the quality of the least squares fit of is quantified by comparing it with the least squares fit resulting from independent white noise regressor vectors. The purpose of the present note is to explain in a general context why the model-based and model-free approach yield the same p-values, although the interpretation of the latter is different under the two paradigms.
View on arXiv