Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2305.10015
Cited By
v1
v2
v3
v4 (latest)
Utility Theory of Synthetic Data Generation
17 May 2023
Shi Xu
W. Sun
Guang Cheng
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Utility Theory of Synthetic Data Generation"
41 / 41 papers shown
Enhancing Obsolescence Forecasting with Deep Generative Data Augmentation: A Semi-Supervised Framework for Low-Data Industrial Applications
Elie Saad
Mariem Besbes
Marc Zolghadri
Victor Czmil
Claude Baron
Vincent Bourgeois
262
0
0
02 May 2025
GReaTER: Generate Realistic Tabular data after data Enhancement and Reduction
Tung Sum Thomas Kwok
Chi-Hua Wang
Guang Cheng
LMTD
304
3
0
19 Mar 2025
A Theoretical Perspective: How to Prevent Model Collapse in Self-consuming Training Loops
International Conference on Learning Representations (ICLR), 2025
Shi Fu
Yingjie Wang
Yuzhu Chen
Xinmei Tian
Dacheng Tao
372
8
0
26 Feb 2025
On the optimal approximation of Sobolev and Besov functions using deep ReLU neural networks
Applied and Computational Harmonic Analysis (ACHA), 2024
Yunfei Yang
395
4
0
02 Sep 2024
A Survey on Statistical Theory of Deep Learning: Approximation, Training Dynamics, and Generative Models
Annual Review of Statistics and Its Application (ARSIA), 2024
Namjoon Suh
Guang Cheng
MedIm
349
18
0
14 Jan 2024
Downstream Task-Oriented Generative Model Selections on Synthetic Data Training for Fraud Detection Models
Yinan Cheng
ChiHua Wang
Vamsi K. Potluru
T. Balch
Guang Cheng
177
9
0
01 Jan 2024
Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for Pixel-Level Semantic Segmentation
Neural Information Processing Systems (NeurIPS), 2023
Quang H. Nguyen
T. Vu
Anh Tran
Kim Dan Nguyen
DiffM
483
137
0
25 Sep 2023
Realistic Synthetic Financial Transactions for Anti-Money Laundering Models
Neural Information Processing Systems (NeurIPS), 2023
Erik Altman
Jovan Blanuvsa
Luc von Niederhäusern
Béni Egressy
Andreea Anghel
Kubilay Atasu
341
76
0
22 Jun 2023
Synthetic data, real errors: how (not) to publish and use synthetic data
International Conference on Machine Learning (ICML), 2023
B. V. Breugel
Zhaozhi Qian
M. Schaar
SyDa
279
42
0
16 May 2023
Synthetic Data from Diffusion Models Improves ImageNet Classification
Shekoofeh Azizi
Simon Kornblith
Chitwan Saharia
Mohammad Norouzi
David J. Fleet
VLM
DiffM
809
389
0
17 Apr 2023
Statistical Theory of Differentially Private Marginal-based Data Synthesis Algorithms
International Conference on Learning Representations (ICLR), 2023
Ximing Li
Chendi Wang
Guang Cheng
SyDa
325
10
0
21 Jan 2023
Optimal Approximation Rates for Deep ReLU Neural Networks on Sobolev and Besov Spaces
Journal of machine learning research (JMLR), 2022
Jonathan W. Siegel
572
43
0
25 Nov 2022
Improving Adversarial Robustness by Contrastive Guided Diffusion Process
International Conference on Machine Learning (ICML), 2022
Yidong Ouyang
Liyan Xie
Guang Cheng
202
10
0
18 Oct 2022
Downstream Datasets Make Surprisingly Good Pretraining Corpora
Annual Meeting of the Association for Computational Linguistics (ACL), 2022
Kundan Krishna
Saurabh Garg
Jeffrey P. Bigham
Zachary Chase Lipton
213
37
0
28 Sep 2022
Synthetic Data -- what, why and how?
James Jordon
Lukasz Szpruch
F. Houssiau
M. Bottarelli
Giovanni Cherubin
Carsten Maple
Samuel N. Cohen
Adrian Weller
253
164
0
06 May 2022
Optimally tackling covariate shift in RKHS-based nonparametric regression
Annals of Statistics (Ann. Stat.), 2022
Cong Ma
Reese Pathak
Martin J. Wainwright
169
56
0
06 May 2022
Generative Adversarial Networks
International Conference on Computing Communication and Networking Technologies (ICCCNT), 2021
Gilad Cohen
Raja Giryes
GAN
816
30,409
0
01 Mar 2022
Distribution-Invariant Differential Privacy
Xuan Bi
Xiaotong Shen
232
20
0
08 Nov 2021
A Deep Generative Approach to Conditional Sampling
Xingyu Zhou
Yuling Jiao
Jin Liu
Jian Huang
150
51
0
19 Oct 2021
Improving Robustness using Generated Data
Sven Gowal
Sylvestre-Alvise Rebuffi
Olivia Wiles
Florian Stimberg
D. A. Calian
Timothy A. Mann
374
351
0
18 Oct 2021
Synthetic Data Generation for Fraud Detection using GANs
C. Charitou
S. Dragicevic
Artur Garcez
127
24
0
26 Sep 2021
Relaxed Marginal Consistency for Differentially Private Query Answering
Ryan McKenna
Siddhant Pradhan
Daniel Sheldon
G. Miklau
274
10
0
13 Sep 2021
Synthetic Data for Model Selection
International Conference on Machine Learning (ICML), 2021
Alon Shoshan
Nadav Bhonker
Igor Kviatkovsky
Matan Fintz
Gérard Medioni
152
7
0
03 May 2021
Maximum Likelihood Training of Score-Based Diffusion Models
Neural Information Processing Systems (NeurIPS), 2021
Yang Song
Conor Durkan
Iain Murray
Stefano Ermon
DiffM
764
799
0
22 Jan 2021
Non-Negative Bregman Divergence Minimization for Deep Direct Density Ratio Estimation
International Conference on Machine Learning (ICML), 2020
Masahiro Kato
Takeshi Teshima
359
49
0
12 Jun 2020
Decision-Making with Auto-Encoding Variational Bayes
Neural Information Processing Systems (NeurIPS), 2020
Romain Lopez
Pierre Boyeau
Nir Yosef
Michael I. Jordan
Jeffrey Regier
BDL
1.5K
20,656
0
17 Feb 2020
Modeling Tabular data using Conditional GAN
Neural Information Processing Systems (NeurIPS), 2019
Lei Xu
Maria Skoularidou
Alfredo Cuesta-Infante
K. Veeramachaneni
CML
MU
SyDa
GAN
489
1,629
0
01 Jul 2019
Unlabeled Data Improves Adversarial Robustness
Neural Information Processing Systems (NeurIPS), 2019
Y. Carmon
Aditi Raghunathan
Ludwig Schmidt
Abigail Z. Jacobs
John C. Duchi
490
791
0
31 May 2019
SynC: A Unified Framework for Generating Synthetic Population with Gaussian Copula
Colin Wan
Zheng Li
Alicia Guo
Yue Zhao
SyDa
126
8
0
16 Apr 2019
How Well Generative Adversarial Networks Learn Distributions
Journal of machine learning research (JMLR), 2018
Tengyuan Liang
GAN
322
110
0
07 Nov 2018
Measuring the quality of Synthetic data for use in competitions
James Jordon
Chang Jo Kim
M. Schaar
105
33
0
29 Jun 2018
Training Deep Networks with Synthetic Data: Bridging the Reality Gap by Domain Randomization
Jonathan Tremblay
Aayush Prakash
David Acuna
M. Brophy
Varun Jampani
Cem Anil
Thang To
Eric Cameracci
Shaad Boochoon
Stan Birchfield
OOD
415
924
0
18 Apr 2018
Differentially Private Generative Adversarial Network
Liyang Xie
Kaixiang Lin
Shu Wang
Fei Wang
Jiayu Zhou
SyDa
227
556
0
19 Feb 2018
Synthetic Data Augmentation using GAN for Improved Liver Lesion Classification
Maayan Frid-Adar
Eyal Klang
Michal Amitai
Jacob Goldberger
H. Greenspan
MedIm
GAN
219
800
0
08 Jan 2018
Nonparametric regression using deep neural networks with ReLU activation function
Johannes Schmidt-Hieber
634
926
0
22 Aug 2017
Least Squares Generative Adversarial Networks
Xudong Mao
Qing Li
Haoran Xie
Raymond Y. K. Lau
Zhen Wang
Stephen Paul Smolley
GAN
846
4,880
0
13 Nov 2016
The Normal Law Under Linear Restrictions: Simulation and Estimation via Minimax Tilting
Z. Botev
132
259
0
14 Mar 2016
Conditional Generative Adversarial Nets
M. Berk Mirza
Simon Osindero
GAN
SyDa
AI4CE
965
11,266
0
06 Nov 2014
What Regularized Auto-Encoders Learn from the Data Generating Distribution
Journal of machine learning research (JMLR), 2012
Guillaume Alain
Yoshua Bengio
OOD
DRL
282
530
0
18 Nov 2012
Choice of neighbor order in nearest-neighbor classification
P. Hall
B. Park
R. Samworth
426
329
0
29 Oct 2008
Fast learning rates for plug-in classifiers
Jean-Yves Audibert
Alexandre B. Tsybakov
975
493
0
17 Aug 2007
1