ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.04738
16
2

Histogram Transform Ensembles for Large-scale Regression

8 December 2019
H. Hang
Zhouchen Lin
Xiaoyu Liu
Hongwei Wen
ArXiv (abs)PDFHTML
Abstract

We propose a novel algorithm for large-scale regression problems named histogram transform ensembles (HTE), composed of random rotations, stretchings, and translations. First of all, we investigate the theoretical properties of HTE when the regression function lies in the H\"{o}lder space Ck,αC^{k,\alpha}Ck,α, k∈N0k \in \mathbb{N}_0k∈N0​, α∈(0,1]\alpha \in (0,1]α∈(0,1]. In the case that k=0,1k=0, 1k=0,1, we adopt the constant regressors and develop the na\"{i}ve histogram transforms (NHT). Within the space C0,αC^{0,\alpha}C0,α, although almost optimal convergence rates can be derived for both single and ensemble NHT, we fail to show the benefits of ensembles over single estimators theoretically. In contrast, in the subspace C1,αC^{1,\alpha}C1,α, we prove that if d≥2(1+α)/αd \geq 2(1+\alpha)/\alphad≥2(1+α)/α, the lower bound of the convergence rates for single NHT turns out to be worse than the upper bound of the convergence rates for ensemble NHT. In the other case when k≥2k \geq 2k≥2, the NHT may no longer be appropriate in predicting smoother regression functions. Instead, we apply kernel histogram transforms (KHT) equipped with smoother regressors such as support vector machines (SVMs), and it turns out that both single and ensemble KHT enjoy almost optimal convergence rates. Then we validate the above theoretical results by numerical experiments. On the one hand, simulations are conducted to elucidate that ensemble NHT outperform single NHT. On the other hand, the effects of bin sizes on accuracy of both NHT and KHT also accord with theoretical analysis. Last but not least, in the real-data experiments, comparisons between the ensemble KHT, equipped with adaptive histogram transforms, and other state-of-the-art large-scale regression estimators verify the effectiveness and accuracy of our algorithm.

View on arXiv
Comments on this paper