ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.14822
  4. Cited By
Vector Quantized Diffusion Model for Text-to-Image Synthesis

Vector Quantized Diffusion Model for Text-to-Image Synthesis

29 November 2021
Shuyang Gu
Dong Chen
Jianmin Bao
Fang Wen
Bo Zhang
Dongdong Chen
Lu Yuan
B. Guo
    DiffM
ArXivPDFHTML

Papers citing "Vector Quantized Diffusion Model for Text-to-Image Synthesis"

50 / 563 papers shown
Title
Incorporating Classifier-Free Guidance in Diffusion Model-Based
  Recommendation
Incorporating Classifier-Free Guidance in Diffusion Model-Based Recommendation
Noah Buchanan
Susan Gauch
Quan Mai
DiffM
VLM
20
1
0
16 Sep 2024
Improving Virtual Try-On with Garment-focused Diffusion Models
Improving Virtual Try-On with Garment-focused Diffusion Models
Siqi Wan
Yehao Li
Jingwen Chen
Yingwei Pan
Ting Yao
Yang Cao
Tao Mei
DiffM
31
0
0
12 Sep 2024
Vector Quantized Diffusion Model Based Speech Bandwidth Extension
Vector Quantized Diffusion Model Based Speech Bandwidth Extension
Yuan Fang
Jinglin Bai
Jiajie Wang
Xueliang Zhang
18
0
0
09 Sep 2024
DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic
  Compensation
DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation
Wenliang Zhao
Haolin Wang
Jie Zhou
Jiwen Lu
DiffM
14
1
0
05 Sep 2024
TrajWeaver: Trajectory Recovery with State Propagation Diffusion Model
TrajWeaver: Trajectory Recovery with State Propagation Diffusion Model
Jinming Wang
Hai Wang
Hongkai Wen
Geyong Min
Man Luo
24
0
0
01 Sep 2024
AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
Zanlin Ni
Yulin Wang
Renping Zhou
Rui Lu
Jiayi Guo
Jinyi Hu
Zhiyuan Liu
Yuan Yao
Gao Huang
25
7
0
31 Aug 2024
One-Shot Learning Meets Depth Diffusion in Multi-Object Videos
One-Shot Learning Meets Depth Diffusion in Multi-Object Videos
Anisha Jain
VGen
DiffM
MDE
24
1
0
29 Aug 2024
Show-o: One Single Transformer to Unify Multimodal Understanding and
  Generation
Show-o: One Single Transformer to Unify Multimodal Understanding and Generation
Jinheng Xie
Weijia Mao
Zechen Bai
David Junhao Zhang
Weihao Wang
Kevin Qinghong Lin
Yuchao Gu
Zhijie Chen
Zhenheng Yang
Mike Zheng Shou
44
159
0
22 Aug 2024
MaVEn: An Effective Multi-granularity Hybrid Visual Encoding Framework
  for Multimodal Large Language Model
MaVEn: An Effective Multi-granularity Hybrid Visual Encoding Framework for Multimodal Large Language Model
Chaoya Jiang
Jia Hongrui
Haiyang Xu
Wei Ye
Mengfan Dong
Ming Yan
Ji Zhang
Fei Huang
Shikun Zhang
VLM
43
1
0
22 Aug 2024
Understanding Generative AI Content with Embedding Models
Understanding Generative AI Content with Embedding Models
Max Vargas
Reilly Cannon
A. Engel
Anand D. Sarwate
Tony Chiang
42
3
0
19 Aug 2024
Are CLIP features all you need for Universal Synthetic Image Origin
  Attribution?
Are CLIP features all you need for Universal Synthetic Image Origin Attribution?
Dario Cioni
Christos Tzelepis
Lorenzo Seidenari
Ioannis Patras
35
2
0
17 Aug 2024
LaWa: Using Latent Space for In-Generation Image Watermarking
LaWa: Using Latent Space for In-Generation Image Watermarking
Ahmad Rezaei
Mohammad Akbari
Saeed Ranjbar Alvar
Arezou Fatemi
Yong Zhang
WIGM
27
13
0
11 Aug 2024
D2Styler: Advancing Arbitrary Style Transfer with Discrete Diffusion
  Methods
D2Styler: Advancing Arbitrary Style Transfer with Discrete Diffusion Methods
Onkar Susladkar
Gayatri S Deshmukh
Sparsh Mittal
Parth Shastri
DiffM
31
2
0
07 Aug 2024
Informed Correctors for Discrete Diffusion Models
Informed Correctors for Discrete Diffusion Models
Yixiu Zhao
Jiaxin Shi
Lester W. Mackey
Scott W. Linderman
Lester Mackey
Scott Linderman
37
9
0
30 Jul 2024
Learning Trimodal Relation for AVQA with Missing Modality
Learning Trimodal Relation for AVQA with Missing Modality
Kyu Ri Park
Hong Joo Lee
Jung Uk Kim
29
1
0
23 Jul 2024
LSReGen: Large-Scale Regional Generator via Backward Guidance Framework
LSReGen: Large-Scale Regional Generator via Backward Guidance Framework
Bowen Zhang
Cheng Yang
Xuanhui Liu
DiffM
14
0
0
21 Jul 2024
M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models
M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models
Seung-geun Chi
Hyung-Gun Chi
Hengbo Ma
Nakul Agarwal
Faizan Siddiqui
Karthik Ramani
Kwonjoon Lee
DiffM
46
10
0
19 Jul 2024
LTSim: Layout Transportation-based Similarity Measure for Evaluating
  Layout Generation
LTSim: Layout Transportation-based Similarity Measure for Evaluating Layout Generation
Mayu Otani
Naoto Inoue
Kotaro Kikuchi
Riku Togashi
3DV
21
4
0
17 Jul 2024
Quantised Global Autoencoder: A Holistic Approach to Representing Visual
  Data
Quantised Global Autoencoder: A Holistic Approach to Representing Visual Data
Tim Elsner
Paula Usinger
Victor Czech
Gregor Kobsik
Yanjiang He
I. Lim
Leif Kobbelt
34
0
0
16 Jul 2024
Length-Aware Motion Synthesis via Latent Diffusion
Length-Aware Motion Synthesis via Latent Diffusion
Alessio Sampieri
Alessio Palma
Indro Spinelli
Fabio Galasso
VGen
DiffM
32
5
0
16 Jul 2024
Surgical Text-to-Image Generation
Surgical Text-to-Image Generation
C. Nwoye
Rupak Bose
K. Elgohary
Lorenzo Arboit
Giorgio Carlino
Joël L. Lavanchy
Pietro Mascagni
N. Padoy
MedIm
55
3
0
12 Jul 2024
Several questions of visual generation in 2024
Several questions of visual generation in 2024
Shuyang Gu
22
1
0
11 Jul 2024
InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with
  Semantic Graph Prior
InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph Prior
Chenguo Lin
Yuchen Lin
Panwang Pan
Xuanyang Zhang
Yadong Mu
3DV
44
1
0
10 Jul 2024
RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models
RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models
Bowen Zhang
Yiji Cheng
Chunyu Wang
Ting Zhang
Jiaolong Yang
Yansong Tang
Feng Zhao
Dong Chen
Baining Guo
DiffM
35
18
0
09 Jul 2024
Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene
  Synthesis
Forest2Seq: Revitalizing Order Prior for Sequential Indoor Scene Synthesis
Qi Sun
Hang Zhou
Wengang Zhou
Li Li
Houqiang Li
3DPC
3DV
34
6
0
07 Jul 2024
TimeLDM: Latent Diffusion Model for Unconditional Time Series Generation
TimeLDM: Latent Diffusion Model for Unconditional Time Series Generation
Jian Qian
Miao Sun
Sifan Zhou
Biao Wan
Minhao Li
Patrick Chiang
28
7
0
05 Jul 2024
Timestep-Aware Correction for Quantized Diffusion Models
Timestep-Aware Correction for Quantized Diffusion Models
Yuzhe Yao
Feng Tian
Jun Chen
Haonan Lin
Guang Dai
Yong Liu
Jingdong Wang
DiffM
MQ
33
4
0
04 Jul 2024
Improved Noise Schedule for Diffusion Training
Improved Noise Schedule for Diffusion Training
Tiankai Hang
Shuyang Gu
DiffM
16
10
0
03 Jul 2024
GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
Huanzhang Dou
Ruixiang Li
Wei Su
Xi Li
DiffM
31
1
0
02 Jul 2024
Enhancing Multi-Class Anomaly Detection via Diffusion Refinement with
  Dual Conditioning
Enhancing Multi-Class Anomaly Detection via Diffusion Refinement with Dual Conditioning
Jiawei Zhan
Jinxiang Lai
Bin-Bin Gao
Jun Liu
Xiaochen Chen
Chengjie Wang
27
1
0
02 Jul 2024
DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models
DiffIR2VR-Zero: Zero-Shot Video Restoration with Diffusion-based Image Restoration Models
Chang-Han Yeh
Chin-Yang Lin
Zhixiang Wang
Chi-Wei Hsiao
Ting-Hsuan Chen
Hau-Shiang Shiu
Yu-Lun Liu
VGen
DiffM
54
5
0
01 Jul 2024
What Matters in Detecting AI-Generated Videos like Sora?
What Matters in Detecting AI-Generated Videos like Sora?
Chirui Chang
Zhengzhe Liu
Xiaoyang Lyu
Xiaojuan Qi
DiffM
VGen
85
6
0
27 Jun 2024
A Sanity Check for AI-generated Image Detection
A Sanity Check for AI-generated Image Detection
Shilin Yan
Ouxiang Li
Jiayin Cai
Y. Hao
Xiaolong Jiang
Yao Hu
Weidi Xie
VLM
56
19
0
27 Jun 2024
MultiDiff: Consistent Novel View Synthesis from a Single Image
MultiDiff: Consistent Novel View Synthesis from a Single Image
Norman Muller
Katja Schwarz
Barbara Roessle
Lorenzo Porzi
Samuel Rota Buló
Matthias Nießner
Peter Kontschieder
DiffM
34
22
0
26 Jun 2024
Diffusion Model-Based Video Editing: A Survey
Diffusion Model-Based Video Editing: A Survey
Wenhao Sun
Rong-Cheng Tu
Jingyi Liao
Dacheng Tao
VGen
55
20
0
26 Jun 2024
GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization
GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization
Y. Chen
X. Huang
Quan Zhang
Wei Li
Mingjian Zhu
...
Hanting Chen
Hailin Hu
J. Yang
W. Liu
Jie Hu
EGVM
51
1
0
24 Jun 2024
Neural Residual Diffusion Models for Deep Scalable Vision Generation
Neural Residual Diffusion Models for Deep Scalable Vision Generation
Zhiyuan Ma
Liangliang Zhao
Biqing Qi
Bowen Zhou
DiffM
53
2
0
19 Jun 2024
Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of
  99%
Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99%
Lei Zhu
Fangyun Wei
Yanye Lu
Dong Chen
VLM
30
31
0
17 Jun 2024
An Analysis on Quantizing Diffusion Transformers
An Analysis on Quantizing Diffusion Transformers
Yuewei Yang
Jialiang Wang
Xiaoliang Dai
Peizhao Zhang
Hongbo Zhang
MQ
29
1
0
16 Jun 2024
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis:
  Techniques for Portrait Generation, Driving Mechanisms, and Editing
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing
Ming Meng
Yufei Zhao
Bo Zhang
Yonggui Zhu
Weimin Shi
Maxwell Wen
Zhaoxin Fan
VGen
34
1
0
15 Jun 2024
Rethinking Score Distillation as a Bridge Between Image Distributions
Rethinking Score Distillation as a Bridge Between Image Distributions
David McAllister
Songwei Ge
Jia-Bin Huang
David W. Jacobs
Alexei A. Efros
Aleksander Holyñski
Angjoo Kanazawa
DiffM
54
14
0
13 Jun 2024
Real-Time Deepfake Detection in the Real-World
Real-Time Deepfake Detection in the Real-World
Bar Cavia
Eliahu Horwitz
Tal Reiss
Yedid Hoshen
35
4
0
13 Jun 2024
PaRa: Personalizing Text-to-Image Diffusion via Parameter Rank Reduction
PaRa: Personalizing Text-to-Image Diffusion via Parameter Rank Reduction
Shangyu Chen
Zizheng Pan
Jianfei Cai
Dinh Q. Phung
37
0
0
09 Jun 2024
Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis
Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis
Zanlin Ni
Yulin Wang
Renping Zhou
Jiayi Guo
Jinyi Hu
Zhiyuan Liu
Shiji Song
Yuan Yao
Gao Huang
27
14
0
08 Jun 2024
MotionClone: Training-Free Motion Cloning for Controllable Video
  Generation
MotionClone: Training-Free Motion Cloning for Controllable Video Generation
Pengyang Ling
Jiazi Bu
Pan Zhang
Xiaoyi Dong
Yuhang Zang
Tong Wu
H. Chen
Jiaqi Wang
Yi Jin
VGen
DiffM
26
34
0
08 Jun 2024
TexIm FAST: Text-to-Image Representation for Semantic Similarity
  Evaluation using Transformers
TexIm FAST: Text-to-Image Representation for Semantic Similarity Evaluation using Transformers
Wazib Ansar
Saptarsi Goswami
Amlan Chakrabarti
ViT
19
0
0
06 Jun 2024
Searching Priors Makes Text-to-Video Synthesis Better
Searching Priors Makes Text-to-Video Synthesis Better
Haoran Cheng
Liang Peng
Linxuan Xia
Yuepeng Hu
Hengjia Li
Qinglin Lu
Xiaofei He
Boxi Wu
VGen
DiffM
23
0
0
05 Jun 2024
GraVITON: Graph based garment warping with attention guided inversion
  for Virtual-tryon
GraVITON: Graph based garment warping with attention guided inversion for Virtual-tryon
Sanhita Pathak
V. Kaushik
Brejesh Lall
DiffM
23
0
0
04 Jun 2024
MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training
MoLA: Motion Generation and Editing with Latent Diffusion Enhanced by Adversarial Training
Kengo Uchida
Takashi Shibuya
Yuhta Takida
Naoki Murata
Shusuke Takahashi
Shusuke Takahashi
Yuki Mitsufuji
VGen
44
4
0
04 Jun 2024
Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Qilong Zhangli
Jindong Jiang
Di Liu
Licheng Yu
Xiaoliang Dai
Ankit Ramchandani
Guan Pang
Dimitris N. Metaxas
Praveen Krishnan
DiffM
43
8
0
03 Jun 2024
Previous
123456...101112
Next