Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.14822
Cited By
Vector Quantized Diffusion Model for Text-to-Image Synthesis
29 November 2021
Shuyang Gu
Dong Chen
Jianmin Bao
Fang Wen
Bo Zhang
Dongdong Chen
Lu Yuan
B. Guo
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Vector Quantized Diffusion Model for Text-to-Image Synthesis"
50 / 563 papers shown
Title
Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models
Shweta Mahajan
Tanzila Rahman
Kwang Moo Yi
Leonid Sigal
DiffM
21
17
0
19 Dec 2023
Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models
Angela Castillo
Jonas Kohler
Juan C. Pérez
Juan Pablo Pérez
Albert Pumarola
Bernard Ghanem
Pablo Arbelaez
Ali K. Thabet
11
12
0
19 Dec 2023
IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition
Xiaomeng Yang
Zhi Qiao
Yu Zhou
DiffM
55
1
0
19 Dec 2023
FineControlNet: Fine-level Text Control for Image Generation with Spatially Aligned Text Control Injection
Hongsuk Choi
Isaac Kasahara
Selim Engin
Moritz Graule
Nikhil Chavan-Dafle
Volkan Isler
DiffM
16
3
0
14 Dec 2023
Planning and Rendering: Towards End-to-End Product Poster Generation
Zhaochen Li
Fengheng Li
Wei Feng
Honghe Zhu
An Liu
...
Xin Zhu
Jun-Jun Shen
Zhangang Lin
Jingping Shao
Zhenglu Yang
DiffM
13
1
0
14 Dec 2023
Efficient and Scalable Graph Generation through Iterative Local Expansion
Andreas Bergmeister
Karolis Martinkus
Nathanael Perraudin
Roger Wattenhofer
21
12
0
14 Dec 2023
Black-box Membership Inference Attacks against Fine-tuned Diffusion Models
Yan Pang
Tianhao Wang
17
18
0
13 Dec 2023
Diffusion-based Blind Text Image Super-Resolution
Yuzhe Zhang
Jiawei Zhang
Hao Li
Zhouxia Wang
Luwei Hou
Dongqing Zou
Liheng Bian
27
8
0
13 Dec 2023
FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition
Sicheng Mo
Fangzhou Mu
Kuan Heng Lin
Yanli Liu
Bochen Guan
Yin Li
Bolei Zhou
DiffM
37
60
0
12 Dec 2023
GenDet: Towards Good Generalizations for AI-Generated Image Detection
Mingjian Zhu
Hanting Chen
Mouxiao Huang
Wei Li
Hailin Hu
Jie Hu
Yunhe Wang
21
16
0
12 Dec 2023
Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution
Shangchen Zhou
Peiqing Yang
Jianyi Wang
Yihang Luo
Chen Change Loy
VGen
99
37
0
11 Dec 2023
Separate-and-Enhance: Compositional Finetuning for Text2Image Diffusion Models
Zhipeng Bao
Yijun Li
Krishna Kumar Singh
Yu-Xiong Wang
Martial Hebert
20
8
0
10 Dec 2023
Efficient Quantization Strategies for Latent Diffusion Models
Yuewei Yang
Xiaoliang Dai
Jialiang Wang
Peizhao Zhang
Hongbo Zhang
DiffM
MQ
17
13
0
09 Dec 2023
RL Dreams: Policy Gradient Optimization for Score Distillation based 3D Generation
Aradhya Neeraj Mathur
Phu-Cuong Pham
Aniket Bera
Ojaswa Sharma
19
0
0
08 Dec 2023
Generating Illustrated Instructions
Sachit Menon
Ishan Misra
Rohit Girdhar
DiffM
24
4
0
07 Dec 2023
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Zhiwu Qing
Shiwei Zhang
Jiayu Wang
Xiang Wang
Yujie Wei
Yingya Zhang
Changxin Gao
Nong Sang
VGen
DiffM
24
37
0
07 Dec 2023
Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models
Jiayi Guo
Xingqian Xu
Yifan Pu
Zanlin Ni
Chaofei Wang
Manushree Vasu
Shiji Song
Gao Huang
Humphrey Shi
DiffM
22
28
0
07 Dec 2023
MEVG: Multi-event Video Generation with Text-to-Video Models
Gyeongrok Oh
Jaehwan Jeong
Sieun Kim
Wonmin Byeon
Jinkyu Kim
Sungwoong Kim
Sangpil Kim
VGen
DiffM
33
20
0
07 Dec 2023
Make-A-Storyboard: A General Framework for Storyboard with Disentangled and Merged Control
Sitong Su
Litao Guo
Lianli Gao
Hengtao Shen
Jingkuan Song
DiffM
33
3
0
06 Dec 2023
Diffusion Noise Feature: Accurate and Fast Generated Image Detection
Yichi Zhang
Xiaogang Xu
DiffM
15
12
0
05 Dec 2023
Fully Spiking Denoising Diffusion Implicit Models
Ryo Watanabe
Yusuke Mukuta
Tatsuya Harada
DiffM
18
3
0
04 Dec 2023
QPoser: Quantized Explicit Pose Prior Modeling for Controllable Pose Generation
Yumeng Li
Yaoxiang Ding
Zhong Ren
Kun Zhou
14
1
0
02 Dec 2023
VideoBooth: Diffusion-based Video Generation with Image Prompts
Yuming Jiang
Tianxing Wu
Shuai Yang
Chenyang Si
Dahua Lin
Yu Qiao
Chen Change Loy
Ziwei Liu
DiffM
VGen
32
65
0
01 Dec 2023
DanceMeld: Unraveling Dance Phrases with Hierarchical Latent Codes for Music-to-Dance Synthesis
Xin Gao
Liucheng Hu
Peng Zhang
Bang Zhang
Liefeng Bo
DiffM
24
4
0
30 Nov 2023
4D-fy: Text-to-4D Generation Using Hybrid Score Distillation Sampling
Sherwin Bahmani
Ivan Skorokhodov
Victor Rong
Gordon Wetzstein
Leonidas J. Guibas
Peter Wonka
Sergey Tulyakov
Jeong Joon Park
Andrea Tagliasacchi
David B. Lindell
DiffM
41
103
0
29 Nov 2023
VBench: Comprehensive Benchmark Suite for Video Generative Models
Ziqi Huang
Yinan He
Jiashuo Yu
Fan Zhang
Chenyang Si
...
Xinyuan Chen
Limin Wang
Dahua Lin
Yu Qiao
Ziwei Liu
VGen
62
346
0
29 Nov 2023
Vulnerability of Automatic Identity Recognition to Audio-Visual Deepfakes
Pavel Korshunov
Haolin Chen
Philip N. Garner
S´ebastien Marcel
CVBM
29
4
0
29 Nov 2023
LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS
Zhiwen Fan
Kevin Wang
Kairun Wen
Zehao Zhu
Dejia Xu
Zhangyang Wang
3DGS
23
180
0
28 Nov 2023
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models
Yuwei Guo
Ceyuan Yang
Anyi Rao
Maneesh Agrawala
Dahua Lin
Bo Dai
DiffM
VGen
18
113
0
28 Nov 2023
TextDiffuser-2: Unleashing the Power of Language Models for Text Rendering
Jingye Chen
Yupan Huang
Tengchao Lv
Lei Cui
Qifeng Chen
Furu Wei
DiffM
17
60
0
28 Nov 2023
PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation
Jiancang Ma
Chen Chen
Qingsong Xie
H. Lu
DiffM
VLM
20
3
0
28 Nov 2023
Text-Driven Image Editing via Learnable Regions
Yuanze Lin
Yi-Wen Chen
Yi-Hsuan Tsai
Lu Jiang
Ming-Hsuan Yang
DiffM
21
16
0
28 Nov 2023
DiffAnt: Diffusion Models for Action Anticipation
Zeyun Zhong
Chengzhi Wu
Manuel Martin
Michael Voit
Juergen Gall
Jürgen Beyerer
DiffM
VGen
15
6
0
27 Nov 2023
LLMGA: Multimodal Large Language Model based Generation Assistant
Bin Xia
Shiyin Wang
Yingfan Tao
Yitong Wang
Jiaya Jia
MLLM
25
12
0
27 Nov 2023
DiffusionMat: Alpha Matting as Sequential Refinement Learning
Yangyang Xu
Shengfeng He
Wenqi Shao
Kwan-Yee K. Wong
Yu Qiao
Ping Luo
DiffM
16
3
0
22 Nov 2023
GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning
Jiaxi Lv
Yi Huang
Mingfu Yan
Jiancheng Huang
Jianzhuang Liu
Yifan Liu
Yafei Wen
Xiaoxin Chen
Shifeng Chen
VGen
DiffM
23
23
0
21 Nov 2023
PatchCraft: Exploring Texture Patch for Efficient AI-generated Image Detection
Nan Zhong
Yiran Xu
Sheng Li
Zhenxing Qian
Xinpeng Zhang
8
25
0
21 Nov 2023
EditShield: Protecting Unauthorized Image Editing by Instruction-guided Diffusion Models
Ruoxi Chen
Haibo Jin
Yixin Liu
Jinyin Chen
Haohan Wang
Lichao Sun
20
10
0
19 Nov 2023
Mitigating Exposure Bias in Discriminator Guided Diffusion Models
Eleftherios Tsonis
Paraskevi Tzouveli
Athanasios Voulodimos
DiffM
10
2
0
18 Nov 2023
Formulating Discrete Probability Flow Through Optimal Transport
Pengze Zhang
Hubery Yin
Chen Li
Xiaohua Xie
OT
30
5
0
07 Nov 2023
CDGraph: Dual Conditional Social Graph Synthesizing via Diffusion Model
Jui-Yi Tsai
Ya-Wen Teng
Ho Chiok Yew
De-Nian Yang
Lydia Y. Chen
DiffM
13
1
0
03 Nov 2023
Copilot4D: Learning Unsupervised World Models for Autonomous Driving via Discrete Diffusion
Lunjun Zhang
Yuwen Xiong
Ze Yang
Sergio Casas
Rui Hu
R. Urtasun
34
50
0
02 Nov 2023
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
Haoxin Chen
Menghan Xia
Yin-Yin He
Yong Zhang
Xiaodong Cun
...
Yaofang Liu
Qifeng Chen
Xintao Wang
Chao-Liang Weng
Ying Shan
DiffM
21
277
0
30 Oct 2023
Customizing 360-Degree Panoramas through Text-to-Image Diffusion Models
Hai Wang
Xiaoyu Xiang
Yuchen Fan
Jing-Hao Xue
93
26
0
28 Oct 2023
Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation
Jaemin Cho
Yushi Hu
Roopal Garg
Peter Anderson
Ranjay Krishna
Jason Baldridge
Mohit Bansal
Jordi Pont-Tuset
Su Wang
EGVM
22
66
0
27 Oct 2023
DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
Jingxiang Sun
Bo Zhang
Ruizhi Shao
Lizhen Wang
Wen Liu
Zhenda Xie
Yebin Liu
23
132
0
25 Oct 2023
Open Knowledge Base Canonicalization with Multi-task Unlearning
Bingchen Liu
Shihao Hou
Weixin Zeng
Xiang Zhao
Shijun Liu
Li Pan
6
0
0
25 Oct 2023
Local Statistics for Generative Image Detection
Yung Jer Wong
Teck Khim Ng
DiffM
17
2
0
25 Oct 2023
Composer Style-specific Symbolic Music Generation Using Vector Quantized Discrete Diffusion Models
Jincheng Zhang
Jingjing Tang
C. Saitis
Gyorgy Fazekas
DiffM
25
3
0
21 Oct 2023
ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection
Zhongzhan Huang
Pan Zhou
Shuicheng Yan
Liang Lin
13
26
0
20 Oct 2023
Previous
1
2
3
...
5
6
7
...
10
11
12
Next