370
v1v2v3v4 (latest)

Adversarial random forests for density estimation and generative modeling

International Conference on Artificial Intelligence and Statistics (AISTATS), 2022
Abstract

We propose methods for density estimation and data synthesis using a novel form of unsupervised random forests. Inspired by generative adversarial networks, we implement a recursive procedure in which trees gradually learn structural properties of the data through alternating rounds of generation and discrimination. The method is provably consistent under minimal assumptions. Unlike classic tree-based alternatives, our approach provides smooth (un)conditional densities and allows for fully synthetic data generation. We achieve comparable or superior performance to state-of-the-art probabilistic circuits and deep learning models on various tabular data benchmarks while executing about two orders of magnitude faster on average. An accompanying R\texttt{R} package, arf\texttt{arf}, is available on CRAN\texttt{CRAN}.

View on arXiv
Comments on this paper