ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.10431
36
5

Continuous Diffusion for Mixed-Type Tabular Data

16 December 2023
Markus Mueller
Kathrin Gruber
Dennis Fok
    DiffM
ArXivPDFHTML
Abstract

Score-based generative models, commonly referred to as diffusion models, have proven to be successful at generating text and image data. However, their adaptation to mixed-type tabular data remains underexplored. In this work, we propose CDTD, a Continuous Diffusion model for mixed-type Tabular Data. CDTD is based on a novel combination of score matching and score interpolation to enforce a unified continuous noise distribution for both continuous and categorical features. We explicitly acknowledge the necessity of homogenizing distinct data types by relying on model-specific loss calibration and initializationthis http URLfurther address the high heterogeneity in mixed-type tabular data, we introduce adaptive feature- or type-specific noise schedules. These ensure balanced generative performance across features and optimize the allocation of model capacity across features and diffusion time. Our experimental results show that CDTD consistently outperforms state-of-the-art benchmark models, captures feature correlations exceptionally well, and that heterogeneity in the noise schedule design boosts sample quality. Replication code is available atthis https URL.

View on arXiv
@article{mueller2025_2312.10431,
  title={ Continuous Diffusion for Mixed-Type Tabular Data },
  author={ Markus Mueller and Kathrin Gruber and Dennis Fok },
  journal={arXiv preprint arXiv:2312.10431},
  year={ 2025 }
}
Comments on this paper