Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.14822
Cited By
Vector Quantized Diffusion Model for Text-to-Image Synthesis
29 November 2021
Shuyang Gu
Dong Chen
Jianmin Bao
Fang Wen
Bo Zhang
Dongdong Chen
Lu Yuan
B. Guo
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Vector Quantized Diffusion Model for Text-to-Image Synthesis"
50 / 563 papers shown
Title
Towards Dataset Copyright Evasion Attack against Personalized Text-to-Image Diffusion Models
Kuofeng Gao
Yufei Zhu
Yiming Li
Jiawang Bai
Yong-Liang Yang
Z. Li
Shu-Tao Xia
34
0
0
05 May 2025
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities
X. Zhang
Jintao Guo
Shanshan Zhao
Minghao Fu
Lunhao Duan
Guo-Hua Wang
Qing-Guo Chen
Zhao Xu
Weihua Luo
Kaifu Zhang
DiffM
60
0
0
05 May 2025
Multi-Modal Language Models as Text-to-Image Model Evaluators
Jiahui Chen
Candace Ross
Reyhane Askari Hemmat
Koustuv Sinha
Melissa Hall
M. Drozdzal
Adriana Romero-Soriano
EGVM
60
0
0
01 May 2025
InstructAttribute: Fine-grained Object Attributes editing with Instruction
Xingxi Yin
Jingfeng Zhang
Zhi Li
Y. Li
Y. Zhang
DiffM
97
0
0
01 May 2025
Likelihood-Free Variational Autoencoders
Chen Xu
Qiang Wang
Lijun Sun
DiffM
DRL
78
0
0
24 Apr 2025
Manifold Induced Biases for Zero-shot and Few-shot Detection of Generated Images
Jonathan Brokman
Amit Giloni
Omer Hofman
Roman Vainshtein
Hisashi Kojima
Guy Gilboa
20
1
0
21 Apr 2025
MLEP: Multi-granularity Local Entropy Patterns for Universal AI-generated Image Detection
Lin Yuan
X. Li
Yan Zhang
J. Zhang
Hongbo Li
Xinbo Gao
27
0
0
18 Apr 2025
Anti-Aesthetics: Protecting Facial Privacy against Customized Text-to-Image Synthesis
Songping Wang
Yueming Lyu
Shiqi Liu
Ning Li
Tong Tong
Hao Sun
Caifeng Shan
PICV
65
0
0
16 Apr 2025
From Text to Time? Rethinking the Effectiveness of the Large Language Model for Time Series Forecasting
Xinyu Zhang
Shanshan Feng
Xutao Li
AI4TS
24
0
0
09 Apr 2025
Gaussian Mixture Flow Matching Models
Hansheng Chen
Kai Zhang
Hao Tan
Zexiang Xu
Fujun Luan
Leonidas J. Guibas
Gordon Wetzstein
Sai Bi
DiffM
61
0
0
07 Apr 2025
Video-Bench: Human-Aligned Video Generation Benchmark
Hui Han
Siyuan Li
Jiaqi Chen
Yiwen Yuan
Yuling Wu
...
Y. Li
J. Zhang
Chi Zhang
Li Li
Yongxin Ni
EGVM
VGen
65
0
0
07 Apr 2025
Moment Quantization for Video Temporal Grounding
Xiaolong Sun
Le Wang
Sanping Zhou
Liushuai Shi
Kun Xia
Mengnan Liu
Yabing Wang
Gang Hua
MQ
29
0
0
03 Apr 2025
Exploring the Collaborative Advantage of Low-level Information on Generalizable AI-Generated Image Detection
Ziyin Zhou
Ke Sun
Zhongxi Chen
Xianming Lin
Yunpeng Luo
Ke Yan
Shouhong Ding
Xiaoshuai Sun
29
0
0
01 Apr 2025
MixerMDM: Learnable Composition of Human Motion Diffusion Models
Pablo Ruiz-Ponce
Germán Barquero
Cristina Palmero
Sergio Escalera
José García Rodríguez
DiffM
55
0
0
01 Apr 2025
Style Quantization for Data-Efficient GAN Training
Jian Wang
Xin Lan
Jizhe Zhou
Yuxin Tian
Jiancheng Lv
29
0
0
31 Mar 2025
Training-Free Text-Guided Image Editing with Visual Autoregressive Model
Yufei Wang
Lanqing Guo
Z. Li
Jiaxing Huang
Pichao Wang
Bihan Wen
J. Wang
DiffM
58
1
0
31 Mar 2025
MuseFace: Text-driven Face Editing via Diffusion-based Mask Generation Approach
Xin Zhang
Siting Huang
Xiangyang Luo
Yifan Xie
Weijiang Yu
Heng Chang
Fei Ma
Fei Richard Yu
DiffM
33
0
0
31 Mar 2025
FakeScope: Large Multimodal Expert Model for Transparent AI-Generated Image Forensics
Yixuan Li
Yu Tian
Yipo Huang
Wei Lu
Shiqi Wang
Weisi Lin
Anderson de Rezende Rocha
54
0
0
31 Mar 2025
StrokeFusion: Vector Sketch Generation via Joint Stroke-UDF Encoding and Latent Sequence Diffusion
Jin Zhou
Yi Zhou
Pengfei Xu
Hui Huang
DiffM
54
0
0
31 Mar 2025
Evaluating Text-to-Image Synthesis with a Conditional Fréchet Distance
Jaywon Koo
J. Hernandez
Moayed Haji-Ali
Ziyan Yang
Vicente Ordonez
EGVM
67
0
0
27 Mar 2025
CO-SPY: Combining Semantic and Pixel Features to Detect Synthetic Images by AI
Siyuan Cheng
Lingjuan Lyu
Zhenting Wang
X. Zhang
Vikash Sehwag
40
0
0
24 Mar 2025
Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models
Jinho Jeong
Sangmin Han
Jinwoo Kim
Seon Joo Kim
34
0
0
24 Mar 2025
LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images
Leyang Wang
Joice Lin
DiffM
63
0
0
20 Mar 2025
FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing
Tianyi Wei
Yifan Zhou
Dongdong Chen
Xingang Pan
72
0
0
20 Mar 2025
Tokenize Image as a Set
Zigang Geng
Mengde Xu
Han Hu
Shuyang Gu
DiffM
48
0
0
20 Mar 2025
TextInVision: Text and Prompt Complexity Driven Visual Text Generation Benchmark
Forouzan Fallah
Maitreya Patel
Agneet Chatterjee
Vlad I. Morariu
Chitta Baral
Yezhou Yang
CoGe
59
0
0
17 Mar 2025
InteractEdit: Zero-Shot Editing of Human-Object Interactions in Images
Jiun Tian Hoe
Weipeng Hu
Wei Zhou
Chao Xie
Ziwei Wang
Chee Seng Chan
Xudong Jiang
Y. Tan
61
0
0
12 Mar 2025
Generalizable AI-Generated Image Detection Based on Fractal Self-Similarity in the Spectrum
Shengpeng Xiao
Yuanfang Guo
Heqi Peng
Zeming Liu
Liang Yang
Y. Wang
57
0
0
11 Mar 2025
Few-Shot Class-Incremental Model Attribution Using Learnable Representation From CLIP-ViT Features
Hanbyul Lee
Juneho Yi
DiffM
46
0
0
11 Mar 2025
A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications
Siyuan Mu
Sen Lin
MoE
87
1
0
10 Mar 2025
Explainable Synthetic Image Detection through Diffusion Timestep Ensembling
Y. Wu
Feiran Zhang
Tianyuan Shi
Ruicheng Yin
Zhenghua Wang
Zhenliang Gan
X. Wang
Changze Lv
Xiaoqing Zheng
Xuanjing Huang
55
0
0
08 Mar 2025
Generalized Interpolating Discrete Diffusion
Dimitri von Rutte
J. Fluri
Yuhui Ding
Antonio Orvieto
Bernhard Scholkopf
Thomas Hofmann
DiffM
59
0
0
06 Mar 2025
Towards Improved Text-Aligned Codebook Learning: Multi-Hierarchical Codebook-Text Alignment with Long Text
Guotao Liang
Baoquan Zhang
Zhiyuan Wen
Junteng Zhao
Yunming Ye
Kola Ye
Yao He
43
0
0
03 Mar 2025
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
Ziyang Zhang
Yang Yu
Yucheng Chen
Xulei Yang
S. Yeo
MedIm
48
1
0
02 Mar 2025
AesthetiQ: Enhancing Graphic Layout Design via Aesthetic-Aware Preference Alignment of Multi-modal Large Language Models
Sohan Patnaik
Rishabh Jain
Balaji Krishnamurthy
Mausoom Sarkar
26
0
0
01 Mar 2025
Bayesian Computation in Deep Learning
Wenlong Chen
Bolian Li
Ruqi Zhang
Yingzhen Li
BDL
70
0
0
25 Feb 2025
LaRE
2
^2
2
: Latent Reconstruction Error Based Method for Diffusion-Generated Image Detection
Yunpeng Luo
Junlong Du
Ke Yan
Shouhong Ding
DiffM
132
19
0
24 Feb 2025
FreeBlend: Advancing Concept Blending with Staged Feedback-Driven Interpolation Diffusion
Yufan Zhou
Haoyu Shen
Huan Wang
DiffM
100
0
0
17 Feb 2025
PDA: Generalizable Detection of AI-Generated Images via Post-hoc Distribution Alignment
Li Wang
Wenyu Chen
Zheng Li
Shanqing Guo
34
0
0
15 Feb 2025
UniMoD: Efficient Unified Multimodal Transformers with Mixture-of-Depths
Weijia Mao
Z. Yang
Mike Zheng Shou
MoE
63
0
0
10 Feb 2025
Improved Training Technique for Latent Consistency Models
Quan Dao
Khanh Doan
Di Liu
Trung Le
Dimitris N. Metaxas
60
3
0
03 Feb 2025
Categorical Schr\"odinger Bridge Matching
Grigoriy Ksenofontov
Alexander Korotin
56
0
0
03 Feb 2025
DreamOmni: Unified Image Generation and Editing
Bin Xia
Yuechen Zhang
Jingyao Li
Chengyao Wang
Yitong Wang
Xinglong Wu
Bei Yu
Jiaya Jia
SyDa
MLLM
79
3
0
22 Dec 2024
GCA-3D: Towards Generalized and Consistent Domain Adaptation of 3D Generators
Hengjia Li
Yang Liu
Yibo Zhao
Haoran Cheng
Yang Yang
...
Qibo Qiu
Boxi Wu
Tu Zheng
Zheng Yang
D. Cai
87
0
0
20 Dec 2024
LaMI-GO: Latent Mixture Integration for Goal-Oriented Communications Achieving High Spectrum Efficiency
Achintha Wijesinghe
Suchinthaka Wanninayaka
Weiwei Wang
Yu-Chieh Chao
Songyang Zhang
Zhi Ding
28
0
0
18 Dec 2024
Learning Implicit Features with Flow Infused Attention for Realistic Virtual Try-On
Delong Zhang
Qiwei Huang
Yuanliu Liu
Yang Sun
Wei-Shi Zheng
Pengfei Xiong
Wei Zhang
3DH
73
0
0
16 Dec 2024
Dual-Schedule Inversion: Training- and Tuning-Free Inversion for Real Image Editing
Jiancheng Huang
Yi Huang
Jianzhuang Liu
Donghao Zhou
Y. Liu
Shifeng Chen
DiffM
77
0
0
15 Dec 2024
Video Diffusion Transformers are In-Context Learners
Zhengcong Fei
Di Qiu
Changqian Yu
Debang Li
Mingyuan Fan
VGen
DiffM
130
2
0
14 Dec 2024
FIRE: Robust Detection of Diffusion-Generated Images via Frequency-Guided Reconstruction Error
Beilin Chu
Xuan Xu
Xin Wang
Y. Zhang
Weike You
Linna Zhou
DiffM
92
1
0
10 Dec 2024
[MASK] is All You Need
Vincent Tao Hu
Bjorn Ommer
DiffM
135
2
0
09 Dec 2024
1
2
3
4
...
10
11
12
Next