ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1806.06179
93
63
v1v2 (latest)

Semi-supervised Inference for Explained Variance in High-dimensional Linear Regression and Its Applications

16 June 2018
T. Tony Cai
Zijian Guo
ArXiv (abs)PDFHTML
Abstract

We consider statistical inference for the explained variance β⊺Σβ\beta^{\intercal}\Sigma \betaβ⊺Σβ under the high-dimensional linear model Y=Xβ+ϵY=X\beta+\epsilonY=Xβ+ϵ in the semi-supervised setting, where β\betaβ is the regression vector and Σ\SigmaΣ is the design covariance matrix. A calibrated estimator, which efficiently integrates both labelled and unlabelled data, is proposed. It is shown that the estimator achieves the minimax optimal rate of convergence in the general semi-supervised framework. The optimality result characterizes how the unlabelled data affects the minimax optimal rate. Moreover, the limiting distribution for the proposed estimator is established and data-driven confidence intervals for the explained variance are constructed. We further develop a randomized calibration technique for statistical inference in the presence of weak signals and apply the obtained inference results to a range of important statistical problems, including signal detection and global testing, prediction accuracy evaluation, and confidence ball construction. The numerical performance of the proposed methodology is demonstrated in simulation studies and an analysis of estimating heritability for a yeast segregant data set with multiple traits.

View on arXiv
Comments on this paper