70
6

Degrees of freedom for combining regression with factor analysis

Natesh S. Pillai
Abstract

In multivariate regression problems with multiple responses, there often exist unobserved covariates which are correlated with the responses. It is possible to estimate these covariates via factor analytic methods, but calculating unbiased error variance estimates after adjusting for latent factors requires assigning appropriate degrees of freedom to the estimated factors. Many ad-hoc solutions to this problem have been proposed without the backup of a careful theoretical analysis. Using recent results from random matrix theory, we derive an expression for degrees of freedom. Our estimate gives a principled alternative to ad-hoc approaches in common use. Extensive simulation results show excellent agreement between the proposed estimator and its theoretical value. When we apply the methods to a microarray dataset, with 2 estimated latent factors, our estimate assigns between 2.18 and 2.99 degrees of freedom, depending on which response is under consideration.

View on arXiv
Comments on this paper