v1v2v3v4 (latest)

Utility Theory of Synthetic Data Generation

17 May 2023

Papers citing "Utility Theory of Synthetic Data Generation"

41 / 41 papers shown

Enhancing Obsolescence Forecasting with Deep Generative Data Augmentation: A Semi-Supervised Framework for Low-Data Industrial Applications

262

02 May 2025

GReaTER: Generate Realistic Tabular data after data Enhancement and Reduction

309

19 Mar 2025

A Theoretical Perspective: How to Prevent Model Collapse in Self-consuming Training LoopsInternational Conference on Learning Representations (ICLR), 2025

376

26 Feb 2025

On the optimal approximation of Sobolev and Besov functions using deep ReLU neural networksApplied and Computational Harmonic Analysis (ACHA), 2024

Yunfei Yang

396

02 Sep 2024

A Survey on Statistical Theory of Deep Learning: Approximation, Training Dynamics, and Generative ModelsAnnual Review of Statistics and Its Application (ARSIA), 2024

Namjoon Suh

Guang Cheng

MedIm

355

14 Jan 2024

Downstream Task-Oriented Generative Model Selections on Synthetic Data Training for Fraud Detection Models

184

01 Jan 2024

Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for Pixel-Level Semantic SegmentationNeural Information Processing Systems (NeurIPS), 2023

483

137

25 Sep 2023

Realistic Synthetic Financial Transactions for Anti-Money Laundering ModelsNeural Information Processing Systems (NeurIPS), 2023

Erik Altman

Jovan Blanuvsa

Luc von Niederhäusern

Béni Egressy

Andreea Anghel

Kubilay Atasu

350

22 Jun 2023

Synthetic data, real errors: how (not) to publish and use synthetic dataInternational Conference on Machine Learning (ICML), 2023

280

16 May 2023

Synthetic Data from Diffusion Models Improves ImageNet Classification

David J. Fleet

844

390

17 Apr 2023

Statistical Theory of Differentially Private Marginal-based Data Synthesis AlgorithmsInternational Conference on Learning Representations (ICLR), 2023

334

21 Jan 2023

Optimal Approximation Rates for Deep ReLU Neural Networks on Sobolev and Besov SpacesJournal of machine learning research (JMLR), 2022

Jonathan W. Siegel

582

25 Nov 2022

Improving Adversarial Robustness by Contrastive Guided Diffusion ProcessInternational Conference on Machine Learning (ICML), 2022

Yidong Ouyang

Liyan Xie

Guang Cheng

208

18 Oct 2022

Downstream Datasets Make Surprisingly Good Pretraining CorporaAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

221

28 Sep 2022

Synthetic Data -- what, why and how?

256

166

06 May 2022

Optimally tackling covariate shift in RKHS-based nonparametric regressionAnnals of Statistics (Ann. Stat.), 2022

Cong Ma

Reese Pathak

Martin J. Wainwright

171

06 May 2022

Generative Adversarial NetworksInternational Conference on Computing Communication and Networking Technologies (ICCCNT), 2021

Gilad Cohen

Raja Giryes

GAN

816

30,401

01 Mar 2022

Distribution-Invariant Differential Privacy

Xuan Bi

Xiaotong Shen

241

08 Nov 2021

A Deep Generative Approach to Conditional Sampling

Xingyu Zhou

Yuling Jiao

Jin Liu

Jian Huang

157

19 Oct 2021

Improving Robustness using Generated Data

Sven Gowal

Sylvestre-Alvise Rebuffi

388

351

18 Oct 2021

Synthetic Data Generation for Fraud Detection using GANs

C. Charitou

S. Dragicevic

Artur Garcez

131

26 Sep 2021

Relaxed Marginal Consistency for Differentially Private Query Answering

280

13 Sep 2021

Synthetic Data for Model SelectionInternational Conference on Machine Learning (ICML), 2021

Igor Kviatkovsky

152

03 May 2021

Maximum Likelihood Training of Score-Based Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2021

788

804

22 Jan 2021

Non-Negative Bregman Divergence Minimization for Deep Direct Density Ratio EstimationInternational Conference on Machine Learning (ICML), 2020

Masahiro Kato

Takeshi Teshima

368

12 Jun 2020

Decision-Making with Auto-Encoding Variational BayesNeural Information Processing Systems (NeurIPS), 2020

Romain Lopez

Pierre Boyeau

Nir Yosef

Michael I. Jordan

Jeffrey Regier

BDL

1.5K

20,656

17 Feb 2020

Modeling Tabular data using Conditional GANNeural Information Processing Systems (NeurIPS), 2019

Lei Xu

Maria Skoularidou

Alfredo Cuesta-Infante

K. Veeramachaneni

CML MU SyDa GAN

497

1,639

01 Jul 2019

Unlabeled Data Improves Adversarial RobustnessNeural Information Processing Systems (NeurIPS), 2019

493

793

31 May 2019

SynC: A Unified Framework for Generating Synthetic Population with Gaussian Copula

133

16 Apr 2019

How Well Generative Adversarial Networks Learn DistributionsJournal of machine learning research (JMLR), 2018

Tengyuan Liang

GAN

325

110

07 Nov 2018

Measuring the quality of Synthetic data for use in competitions

James Jordon

Chang Jo Kim

M. Schaar

105

29 Jun 2018

Training Deep Networks with Synthetic Data: Bridging the Reality Gap by Domain Randomization

442

926

18 Apr 2018

Differentially Private Generative Adversarial Network

Fei Wang

227

557

19 Feb 2018

Synthetic Data Augmentation using GAN for Improved Liver Lesion Classification

Michal Amitai

222

800

08 Jan 2018

Nonparametric regression using deep neural networks with ReLU activation function

Johannes Schmidt-Hieber

639

928

22 Aug 2017

Least Squares Generative Adversarial Networks

Xudong Mao

Haoran Xie

882

4,897

13 Nov 2016

The Normal Law Under Linear Restrictions: Simulation and Estimation via Minimax Tilting

Z. Botev

132

260

14 Mar 2016

Conditional Generative Adversarial Nets

M. Berk Mirza

Simon Osindero

GAN SyDa AI4CE

967

11,281

06 Nov 2014

What Regularized Auto-Encoders Learn from the Data Generating DistributionJournal of machine learning research (JMLR), 2012

Guillaume Alain

Yoshua Bengio

OOD DRL

292

530

18 Nov 2012

Choice of neighbor order in nearest-neighbor classification

P. Hall

B. Park

R. Samworth

426

329

29 Oct 2008

Fast learning rates for plug-in classifiers

Jean-Yves Audibert

Alexandre B. Tsybakov

985

494

17 Aug 2007