ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.00943
30
4
v1v2v3v4v5v6v7 (latest)

Data-generating models under which the random forest algorithm performs badly

2 October 2019
J. A. Ferreira
ArXiv (abs)PDFHTML
Abstract

Examples are given of data-generating models under which some versions of the random forest algorithm may fail to be consistent or at least may be extremely slow to converge to the optimal predictor. Evidence provided for these properties is based on partly intuitive and partly rigorous arguments and on numerical experiments. Although one can always choose a model under which random forests perform very badly, in each case simple methods based on statistics of `variable use' and `variable importance' can be used to construct a better predictor based on a sort of mixture of random forests.

View on arXiv
Comments on this paper