v1v2v3v4 (latest)

Utility Theory of Synthetic Data Generation

17 May 2023

Papers citing "Utility Theory of Synthetic Data Generation"

41 / 41 papers shown

Enhancing Obsolescence Forecasting with Deep Generative Data Augmentation: A Semi-Supervised Framework for Low-Data Industrial Applications

262

02 May 2025

GReaTER: Generate Realistic Tabular data after data Enhancement and Reduction

304

19 Mar 2025

A Theoretical Perspective: How to Prevent Model Collapse in Self-consuming Training LoopsInternational Conference on Learning Representations (ICLR), 2025

372

26 Feb 2025

On the optimal approximation of Sobolev and Besov functions using deep ReLU neural networksApplied and Computational Harmonic Analysis (ACHA), 2024

Yunfei Yang

395

02 Sep 2024

A Survey on Statistical Theory of Deep Learning: Approximation, Training Dynamics, and Generative ModelsAnnual Review of Statistics and Its Application (ARSIA), 2024

Namjoon Suh

Guang Cheng

MedIm

349

14 Jan 2024

Downstream Task-Oriented Generative Model Selections on Synthetic Data Training for Fraud Detection Models

177

01 Jan 2024

Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for Pixel-Level Semantic SegmentationNeural Information Processing Systems (NeurIPS), 2023

483

137

25 Sep 2023

Realistic Synthetic Financial Transactions for Anti-Money Laundering ModelsNeural Information Processing Systems (NeurIPS), 2023

Erik Altman

Jovan Blanuvsa

Luc von Niederhäusern

Béni Egressy

Andreea Anghel

Kubilay Atasu

341

22 Jun 2023

Synthetic data, real errors: how (not) to publish and use synthetic dataInternational Conference on Machine Learning (ICML), 2023

279

16 May 2023

Synthetic Data from Diffusion Models Improves ImageNet Classification

David J. Fleet

809

389

17 Apr 2023

Statistical Theory of Differentially Private Marginal-based Data Synthesis AlgorithmsInternational Conference on Learning Representations (ICLR), 2023

325

21 Jan 2023

Optimal Approximation Rates for Deep ReLU Neural Networks on Sobolev and Besov SpacesJournal of machine learning research (JMLR), 2022

Jonathan W. Siegel

572

25 Nov 2022

Improving Adversarial Robustness by Contrastive Guided Diffusion ProcessInternational Conference on Machine Learning (ICML), 2022

Yidong Ouyang

Liyan Xie

Guang Cheng

202

18 Oct 2022

Downstream Datasets Make Surprisingly Good Pretraining CorporaAnnual Meeting of the Association for Computational Linguistics (ACL), 2022

213

28 Sep 2022

Synthetic Data -- what, why and how?

253

164

06 May 2022

Optimally tackling covariate shift in RKHS-based nonparametric regressionAnnals of Statistics (Ann. Stat.), 2022

Cong Ma

Reese Pathak

Martin J. Wainwright

169

06 May 2022

Generative Adversarial NetworksInternational Conference on Computing Communication and Networking Technologies (ICCCNT), 2021

Gilad Cohen

Raja Giryes

GAN

816

30,409

01 Mar 2022

Distribution-Invariant Differential Privacy

Xuan Bi

Xiaotong Shen

232

08 Nov 2021

A Deep Generative Approach to Conditional Sampling

Xingyu Zhou

Yuling Jiao

Jin Liu

Jian Huang

150

19 Oct 2021

Improving Robustness using Generated Data

Sven Gowal

Sylvestre-Alvise Rebuffi

374

351

18 Oct 2021

Synthetic Data Generation for Fraud Detection using GANs

C. Charitou

S. Dragicevic

Artur Garcez

127

26 Sep 2021

Relaxed Marginal Consistency for Differentially Private Query Answering

274

13 Sep 2021

Synthetic Data for Model SelectionInternational Conference on Machine Learning (ICML), 2021

Igor Kviatkovsky

152

03 May 2021

Maximum Likelihood Training of Score-Based Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2021

764

799

22 Jan 2021

Non-Negative Bregman Divergence Minimization for Deep Direct Density Ratio EstimationInternational Conference on Machine Learning (ICML), 2020

Masahiro Kato

Takeshi Teshima

359

12 Jun 2020

Decision-Making with Auto-Encoding Variational BayesNeural Information Processing Systems (NeurIPS), 2020

Romain Lopez

Pierre Boyeau

Nir Yosef

Michael I. Jordan

Jeffrey Regier

BDL

1.5K

20,656

17 Feb 2020

Modeling Tabular data using Conditional GANNeural Information Processing Systems (NeurIPS), 2019

Lei Xu

Maria Skoularidou

Alfredo Cuesta-Infante

K. Veeramachaneni

CML MU SyDa GAN

489

1,629

01 Jul 2019

Unlabeled Data Improves Adversarial RobustnessNeural Information Processing Systems (NeurIPS), 2019

490

791

31 May 2019

SynC: A Unified Framework for Generating Synthetic Population with Gaussian Copula

126

16 Apr 2019

How Well Generative Adversarial Networks Learn DistributionsJournal of machine learning research (JMLR), 2018

Tengyuan Liang

GAN

322

110

07 Nov 2018

Measuring the quality of Synthetic data for use in competitions

James Jordon

Chang Jo Kim

M. Schaar

105

29 Jun 2018

Training Deep Networks with Synthetic Data: Bridging the Reality Gap by Domain Randomization

415

924

18 Apr 2018

Differentially Private Generative Adversarial Network

Fei Wang

227

556

19 Feb 2018

Synthetic Data Augmentation using GAN for Improved Liver Lesion Classification

Michal Amitai

219

800

08 Jan 2018

Nonparametric regression using deep neural networks with ReLU activation function

Johannes Schmidt-Hieber

634

926

22 Aug 2017

Least Squares Generative Adversarial Networks

Xudong Mao

Haoran Xie

846

4,880

13 Nov 2016

The Normal Law Under Linear Restrictions: Simulation and Estimation via Minimax Tilting

Z. Botev

132

259

14 Mar 2016

Conditional Generative Adversarial Nets

M. Berk Mirza

Simon Osindero

GAN SyDa AI4CE

965

11,266

06 Nov 2014

What Regularized Auto-Encoders Learn from the Data Generating DistributionJournal of machine learning research (JMLR), 2012

Guillaume Alain

Yoshua Bengio

OOD DRL

282

530

18 Nov 2012

Choice of neighbor order in nearest-neighbor classification

P. Hall

B. Park

R. Samworth

426

329

29 Oct 2008

Fast learning rates for plug-in classifiers

Jean-Yves Audibert

Alexandre B. Tsybakov

975

493

17 Aug 2007