ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2410.08207
26
4

DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models

10 October 2024
Xiaoxiao He
Ligong Han
Quan Dao
Song Wen
Minhao Bai
Di Liu
Han Zhang
Martin Renqiang Min
Felix Juefei Xu
Chaowei Tan
Bo Liu
Kang Li
Hongdong Li
Junzhou Huang
Faez Ahmed
Akash Srivastava
Dimitris Metaxas
    DiffM
    SyDa
ArXivPDFHTML
Abstract

Discrete diffusion models have achieved success in tasks like image generation and masked language modeling but face limitations in controlled content editing. We introduce DICE (Discrete Inversion for Controllable Editing), the first approach to enable precise inversion for discrete diffusion models, including multinomial diffusion and masked generative models. By recording noise sequences and masking patterns during the reverse diffusion process, DICE enables accurate reconstruction and flexible editing of discrete data without the need for predefined masks or attention manipulation. We demonstrate the effectiveness of DICE across both image and text domains, evaluating it on models such as VQ-Diffusion, Paella, and RoBERTa. Our results show that DICE preserves high data fidelity while enhancing editing capabilities, offering new opportunities for fine-grained content manipulation in discrete spaces.

View on arXiv
@article{he2025_2410.08207,
  title={ DICE: Discrete Inversion Enabling Controllable Editing for Multinomial Diffusion and Masked Generative Models },
  author={ Xiaoxiao He and Ligong Han and Quan Dao and Song Wen and Minhao Bai and Di Liu and Han Zhang and Martin Renqiang Min and Felix Juefei-Xu and Chaowei Tan and Bo Liu and Kang Li and Hongdong Li and Junzhou Huang and Faez Ahmed and Akash Srivastava and Dimitris Metaxas },
  journal={arXiv preprint arXiv:2410.08207},
  year={ 2025 }
}
Comments on this paper