ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.02723
131
0
v1v2 (latest)

Computing High-dimensional Confidence Sets for Arbitrary Distributions

3 April 2025
Chao Gao
Liren Shan
Vaidehi Srinivas
Aravindan Vijayaraghavan
ArXiv (abs)PDFHTML
Main:35 Pages
7 Figures
Bibliography:5 Pages
Appendix:1 Pages
Abstract

We study the problem of learning a high-density region of an arbitrary distribution over Rd\mathbb{R}^dRd. Given a target coverage parameter δ\deltaδ, and sample access to an arbitrary distribution DDD, we want to output a confidence set S⊂RdS \subset \mathbb{R}^dS⊂Rd such that SSS achieves δ\deltaδ coverage of DDD, i.e., Py∼D[y∈S]≥δ\mathbb{P}_{y \sim D} \left[ y \in S \right] \ge \deltaPy∼D​[y∈S]≥δ, and the volume of SSS is as small as possible. This is a central problem in high-dimensional statistics with applications in finding confidence sets, uncertainty quantification, and support estimation.In the most general setting, this problem is statistically intractable, so we restrict our attention to competing with sets from a concept class CCC with bounded VC-dimension. An algorithm is competitive with class CCC if, given samples from an arbitrary distribution DDD, it outputs in polynomial time a set that achieves δ\deltaδ coverage of DDD, and whose volume is competitive with the smallest set in CCC with the required coverage δ\deltaδ. This problem is computationally challenging even in the basic setting when CCC is the set of all Euclidean balls. Existing algorithms based on coresets find in polynomial time a ball whose volume is exp⁡(O~(d/log⁡d))\exp(\tilde{O}( d/ \log d))exp(O~(d/logd))-factor competitive with the volume of the best ball.Our main result is an algorithm that finds a confidence set whose volume is exp⁡(O~(d1/2))\exp(\tilde{O}(d^{1/2}))exp(O~(d1/2)) factor competitive with the optimal ball having the desired coverage. The algorithm is improper (it outputs an ellipsoid). Combined with our computational intractability result for proper learning balls within an exp⁡(O~(d1−o(1)))\exp(\tilde{O}(d^{1-o(1)}))exp(O~(d1−o(1))) approximation factor in volume, our results provide an interesting separation between proper and (improper) learning of confidence sets.

View on arXiv
@article{gao2025_2504.02723,
  title={ Computing High-dimensional Confidence Sets for Arbitrary Distributions },
  author={ Chao Gao and Liren Shan and Vaidehi Srinivas and Aravindan Vijayaraghavan },
  journal={arXiv preprint arXiv:2504.02723},
  year={ 2025 }
}
Comments on this paper