ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1108.1961
95
14

Adaptive Minimax Estimation over Sparse ℓq\ell_qℓq​-Hulls

9 August 2011
Zhan Wang
S. Paterlini
Frank Gao
Yuhong Yang
ArXivPDFHTML
Abstract

Given a dictionary of MnM_nMn​ initial estimates of the unknown true regression function, we aim to construct linearly aggregated estimators that target the best performance among all the linear combinations under a sparse qqq-norm (0≤q≤10 \leq q \leq 10≤q≤1) constraint on the linear coefficients. Besides identifying the optimal rates of aggregation for these ℓq\ell_qℓq​-aggregation problems, our multi-directional (or universal) aggregation strategies by model mixing or model selection achieve the optimal rates simultaneously over the full range of 0≤q≤10\leq q \leq 10≤q≤1 for general MnM_nMn​ and upper bound tnt_ntn​ of the qqq-norm. Both random and fixed designs, with known or unknown error variance, are handled, and the ℓq\ell_qℓq​-aggregations examined in this work cover major types of aggregation problems previously studied in the literature. Consequences on minimax-rate adaptive regression under ℓq\ell_qℓq​-constrained true coefficients (0≤q≤10 \leq q \leq 10≤q≤1) are also provided. Our results show that the minimax rate of ℓq\ell_qℓq​-aggregation (0≤q≤10 \leq q \leq 10≤q≤1) is basically determined by an effective model size, which is a sparsity index that depends on qqq, tnt_ntn​, MnM_nMn​, and the sample size nnn in an easily interpretable way based on a classical model selection theory that deals with a large number of models. In addition, in the fixed design case, the model selection approach is seen to yield optimal rates of convergence not only in expectation but also with exponential decay of deviation probability. In contrast, the model mixing approach can have leading constant one in front of the target risk in the oracle inequality while not offering optimality in deviation probability.

View on arXiv
Comments on this paper