Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2301.07573
Cited By
Synthcity: facilitating innovative use cases of synthetic data in different data modalities
18 January 2023
Zhaozhi Qian
B. Cebere
M. Schaar
SyDa
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Synthcity: facilitating innovative use cases of synthetic data in different data modalities"
42 / 42 papers shown
Title
Generating Reliable Synthetic Clinical Trial Data: The Role of Hyperparameter Optimization and Domain Constraints
Waldemar Hahn
Jan-Niklas Eckardt
Christoph Röllig
Martin Sedlmayr
J. Middeke
M. Wolfien
36
0
0
08 May 2025
What's Wrong with Your Synthetic Tabular Data? Using Explainable AI to Evaluate Generative Models
Jan Kapar
Niklas Koenen
Martin Jullum
64
0
0
29 Apr 2025
Diffusion Transformers for Tabular Data Time Series Generation
Fabrizio Garuti
E. Sangineto
Simone Luetto
L. Forni
Rita Cucchiara
57
0
0
10 Apr 2025
Tabby: Tabular Data Synthesis with Language Models
Sonia Cromp
Satya Sai Srinath Namburi GNVV
Mohammed Alkhudhayri
Catherine Cao
Samuel Guo
Nicholas Roberts
Frederic Sala
LMTD
58
0
0
04 Mar 2025
A Generalized Theory of Mixup for Structure-Preserving Synthetic Data
Chungpa Lee
Jongho Im
Joseph H.T. Kim
33
0
0
03 Mar 2025
TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic Data
P. Tiwald
Ivona Krchova
Andrey Sidorenko
Mariana Vargas-Vieyra
Mario Scriminaci
Michael Platzer
44
1
0
21 Jan 2025
Creating Artificial Students that Never Existed: Leveraging Large Language Models and CTGANs for Synthetic Data Generation
Mohammad Khalil
Farhad Vadiee
Ronas Shakya
Qinyi Liu
SyDa
34
5
0
03 Jan 2025
Can Synthetic Data be Fair and Private? A Comparative Study of Synthetic Data Generation and Fairness Algorithms
Qinyi Liu
Oscar Blessed Deho
Farhad Vadiee
Mohammad Khalil
Srecko Joksimovic
George Siemens
SyDa
32
6
0
03 Jan 2025
Debiasing Synthetic Data Generated by Deep Generative Models
A. Decruyenaere
Heidelinde Dehaene
Paloma Rabaey
Christiaan Polet
Johan Decruyenaere
Thomas Demeester
S. Vansteelandt
AI4CE
23
0
0
06 Nov 2024
Generating Realistic Tabular Data with Large Language Models
Dang Nguyen
Sunil Gupta
Kien Do
Thin Nguyen
Svetha Venkatesh
LMTD
SyDa
30
1
0
29 Oct 2024
Benchmarking the Fidelity and Utility of Synthetic Relational Data
Valter Hudovernik
Martin Jurkovič
Erik Štrumbelj
23
2
0
04 Oct 2024
TabEBM: A Tabular Data Augmentation Method with Distinct Class-Specific Energy-Based Models
Andrei Margeloiu
Xiangjian Jiang
Nikola Simidjievski
M. Jamnik
20
5
0
24 Sep 2024
HARMONIC: Harnessing LLMs for Tabular Data Synthesis and Privacy Protection
Yuxin Wang
Duanyu Feng
Yongfu Dai
Zhengyu Chen
Jimin Huang
Sophia Ananiadou
Qianqian Xie
Hao Wang
24
3
0
06 Aug 2024
Synthetic Data: Revisiting the Privacy-Utility Trade-off
Fatima Jahan Sarmin
Atiquer Rahman Sarkar
Yang Wang
Noman Mohammed
20
3
0
09 Jul 2024
Automated Privacy-Preserving Techniques via Meta-Learning
Tânia Carvalho
Nuno Moniz
Luís Antunes
38
0
0
24 Jun 2024
Data Plagiarism Index: Characterizing the Privacy Risk of Data-Copying in Tabular Generative Models
Joshua Ward
Chi-Hua Wang
Guang Cheng
18
3
0
18 Jun 2024
Navigating Tabular Data Synthesis Research: Understanding User Needs and Tool Capabilities
Maria F. Davila
Sven Groen
Fabian Panse
Wolfram Wingerath
LMTD
27
1
0
31 May 2024
"What do you want from theory alone?" Experimenting with Tight Auditing of Differentially Private Synthetic Data Generation
Meenatchi Sundaram Muthu Selva Annamalai
Georgi Ganev
Emiliano De Cristofaro
22
9
0
16 May 2024
Permissioned Blockchain-based Framework for Ranking Synthetic Data Generators
Narasimha Raghavan
Mohammad Hossein Tabatabaei
Severin Elvatun
V. Vallevik
S. Larønningen
J. F. Nygård
27
2
0
12 May 2024
SynthEval: A Framework for Detailed Utility and Privacy Evaluation of Tabular Synthetic Data
A. Lautrup
Tobias Hyrup
Arthur Zimek
Peter Schneider-Kamp
21
0
0
24 Apr 2024
Systematic Assessment of Tabular Data Synthesis Algorithms
Yuntao Du
Ninghui Li
21
4
0
09 Feb 2024
A primer on synthetic health data
Jennifer Anne Bartell
Sander Boisen Valentin
Anders Krogh
Henning Langberg
Martin Bøgsted
19
1
0
31 Jan 2024
Curated LLM: Synergy of LLMs and Data Curation for tabular augmentation in low-data regimes
Nabeel Seedat
Nicolas Huynh
B. V. Breugel
M. Schaar
11
25
0
19 Dec 2023
Continuous Diffusion for Mixed-Type Tabular Data
Markus Mueller
Kathrin Gruber
Dennis Fok
DiffM
48
5
0
16 Dec 2023
SurvTimeSurvival: Survival Analysis On The Patient With Multiple Visits/Records
Le Hung
Eng-Jon Ong
Miroslaw Bober
22
1
0
16 Nov 2023
Reimagining Synthetic Tabular Data Generation through Data-Centric AI: A Comprehensive Benchmark
Lasse Hansen
Nabeel Seedat
M. Schaar
Andrija Petrović
30
17
0
25 Oct 2023
Tertiary Lymphoid Structures Generation through Graph-based Diffusion
Manuel Madeira
D. Thanou
Pascal Frossard
MedIm
25
1
0
10 Oct 2023
A Causal Perspective on Loan Pricing: Investigating the Impacts of Selection Bias on Identifying Bid-Response Functions
Christopher Bockel-Rickermann
Sam Verboven
Tim Verdonck
Wouter Verbeke
CML
13
0
0
07 Sep 2023
Deep Generative Models, Synthetic Tabular Data, and Differential Privacy: An Overview and Synthesis
Conor Hassan
Roberto Salomone
Kerrie Mengersen
11
6
0
28 Jul 2023
On the Usefulness of Synthetic Tabular Data Generation
Dionysis Manousakas
Sergul Aydore
8
8
0
27 Jun 2023
Diverse Community Data for Benchmarking Data Privacy Algorithms
Aniruddha Sen
Christine Task
Dhruv Kapur
Gary S. Howarth
Karan Bhagat
14
3
0
20 Jun 2023
Interpretable Deep Clustering for Tabular Data
Jonathan Svirsky
Ofir Lindenbaum
20
6
0
07 Jun 2023
Post-processing Private Synthetic Data for Improving Utility on Selected Measures
Hao Wang
Shivchander Sudalairaj
J. Henning
Kristjan Greenewald
Akash Srivastava
19
5
0
24 May 2023
TSGM: A Flexible Framework for Generative Modeling of Synthetic Time Series
Alexander Nikitin
Letizia Iannucci
Samuel Kaski
TTA
SyDa
AI4TS
10
10
0
19 May 2023
Synthetic data, real errors: how (not) to publish and use synthetic data
B. V. Breugel
Zhaozhi Qian
M. Schaar
SyDa
57
28
0
16 May 2023
Auditing and Generating Synthetic Data with Controllable Trust Trade-offs
Brian M. Belgodere
Pierre L. Dognin
Adam Ivankay
Igor Melnyk
Youssef Mroueh
...
Mattia Rigotti
Jerret Ross
Yair Schiff
Radhika Vedpathak
Richard A. Young
11
12
0
21 Apr 2023
Beyond Privacy: Navigating the Opportunities and Challenges of Synthetic Data
B. V. Breugel
M. Schaar
12
26
0
07 Apr 2023
Differentially-Private Data Synthetisation for Efficient Re-Identification Risk Control
Tânia Carvalho
Nuno Moniz
Luís Antunes
Nitesh V. Chawla
11
3
0
01 Dec 2022
DC-Check: A Data-Centric AI checklist to guide the development of reliable machine learning systems
Nabeel Seedat
F. Imrie
M. Schaar
12
12
0
09 Nov 2022
Customs Import Declaration Datasets
Chae-Seong Jeong
Sundong Kim
Jaewoo Park
Yeonsoo Choi
13
3
0
04 Aug 2022
Conditional Synthetic Data Generation for Robust Machine Learning Applications with Limited Pandemic Data
Hari Prasanna Das
Ryan Tran
Japjot Singh
Xiangyu Yue
G. Tison
Alberto L. Sangiovanni-Vincentelli
C. Spanos
OOD
MedIm
50
51
0
14 Sep 2021
A Survey on Bias and Fairness in Machine Learning
Ninareh Mehrabi
Fred Morstatter
N. Saxena
Kristina Lerman
Aram Galstyan
SyDa
FaML
286
4,143
0
23 Aug 2019
1