ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2008.04348
43
9

Design based incomplete U-statistics

10 August 2020
Xiangshun Kong
Wei Zheng
ArXiv (abs)PDFHTML
Abstract

U-statistics are widely used in fields such as economics, machine learning, and statistics. However, while they enjoy desirable statistical properties, they have an obvious drawback in that the computation becomes impractical as the data size nnn increases. Specifically, the number of combinations, say mmm, that a U-statistic of order ddd has to evaluate is O(nd)O(n^d)O(nd). Many efforts have been made to approximate the original U-statistic using a small subset of combinations since Blom (1976), who referred to such an approximation as an incomplete U-statistic. To the best of our knowledge, all existing methods require mmm to grow at least faster than nnn, albeit more slowly than ndn^dnd, in order for the corresponding incomplete U-statistic to be asymptotically efficient in terms of the mean squared error. In this paper, we introduce a new type of incomplete U-statistic that can be asymptotically efficient, even when mmm grows more slowly than nnn. In some cases, mmm is only required to grow faster than n\sqrt{n}n​. Our theoretical and empirical results both show significant improvements in the statistical efficiency of the new incomplete U-statistic.

View on arXiv
Comments on this paper