ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.14822
  4. Cited By
Vector Quantized Diffusion Model for Text-to-Image Synthesis

Vector Quantized Diffusion Model for Text-to-Image Synthesis

29 November 2021
Shuyang Gu
Dong Chen
Jianmin Bao
Fang Wen
Bo Zhang
Dongdong Chen
Lu Yuan
B. Guo
    DiffM
ArXivPDFHTML

Papers citing "Vector Quantized Diffusion Model for Text-to-Image Synthesis"

50 / 563 papers shown
Title
Artifact Feature Purification for Cross-domain Detection of AI-generated
  Images
Artifact Feature Purification for Cross-domain Detection of AI-generated Images
Zheling Meng
Bo Peng
Jing Dong
Tieniu Tan
81
2
0
17 Mar 2024
Codebook Transfer with Part-of-Speech for Vector-Quantized Image
  Modeling
Codebook Transfer with Part-of-Speech for Vector-Quantized Image Modeling
Baoquan Zhang
Huaibin Wang
Chuyao Luo
Xutao Li
Guotao Liang
Yunming Ye
Xiaochen Qi
Yao He
32
11
0
15 Mar 2024
UniCode: Learning a Unified Codebook for Multimodal Large Language
  Models
UniCode: Learning a Unified Codebook for Multimodal Large Language Models
Sipeng Zheng
Bohan Zhou
Yicheng Feng
Ye Wang
Zongqing Lu
VLM
MLLM
26
7
0
14 Mar 2024
Data-Independent Operator: A Training-Free Artifact Representation
  Extractor for Generalizable Deepfake Detection
Data-Independent Operator: A Training-Free Artifact Representation Extractor for Generalizable Deepfake Detection
Chuangchuang Tan
Ping Liu
Renshuai Tao
Huan Liu
Yao-Min Zhao
Baoyuan Wu
Yunchao Wei
26
9
0
11 Mar 2024
DiffuMatting: Synthesizing Arbitrary Objects with Matting-level
  Annotation
DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation
Xiaobin Hu
Xu Peng
Donghao Luo
Xiaozhong Ji
Jinlong Peng
Zhengkai Jiang
Jiangning Zhang
Taisong Jin
Chengjie Wang
Rongrong Ji
DiffM
24
4
0
10 Mar 2024
Towards In-Vehicle Multi-Task Facial Attribute Recognition:
  Investigating Synthetic Data and Vision Foundation Models
Towards In-Vehicle Multi-Task Facial Attribute Recognition: Investigating Synthetic Data and Vision Foundation Models
Esmaeil Seraj
Walter Talamonti
27
0
0
10 Mar 2024
StableDrag: Stable Dragging for Point-based Image Editing
StableDrag: Stable Dragging for Point-based Image Editing
Yutao Cui
Xiaotong Zhao
Guozhen Zhang
Shengming Cao
Kai Ma
Limin Wang
33
10
0
07 Mar 2024
Deep-Learned Compression for Radio-Frequency Signal Classification
Deep-Learned Compression for Radio-Frequency Signal Classification
Armani Rodriguez
Yagna Kaasaragadda
S. Kokalj-Filipovic
21
1
0
05 Mar 2024
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and
  Diffusion Models
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Zeqian Ju
Yuancheng Wang
Kai Shen
Xu Tan
Detai Xin
...
Shikun Zhang
Jiang Bian
Lei He
Jinyu Li
Sheng Zhao
DiffM
28
143
0
05 Mar 2024
HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances
HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances
Supreeth Narasimhaswamy
Uttaran Bhattacharya
Xiang Chen
Ishita Dasgupta
Saayan Mitra
Minh Hoai
DiffM
24
23
0
04 Mar 2024
DiffSal: Joint Audio and Video Learning for Diffusion Saliency
  Prediction
DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction
Jun Xiong
Peng Zhang
Tao You
Chuanyue Li
Wei Huang
Yufei Zha
DiffM
21
5
0
02 Mar 2024
Text-guided Explorable Image Super-resolution
Text-guided Explorable Image Super-resolution
Kanchana Vaishnavi Gandikota
Paramanand Chandramouli
40
7
0
02 Mar 2024
Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning
Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning
Xiaoyu Zhang
Matthew Chang
Pranav Kumar
Saurabh Gupta
DiffM
OffRL
43
13
0
27 Feb 2024
DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized
  Diffusion Models
DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Models
Shyam Marjit
Harshit Singh
Nityanand Mathur
Sayak Paul
Chia-Mu Yu
Pin-Yu Chen
DiffM
25
6
0
27 Feb 2024
Diffusion Model-Based Image Editing: A Survey
Diffusion Model-Based Image Editing: A Survey
Yi Huang
Jiancheng Huang
Yifan Liu
Mingfu Yan
Jiaxi Lv
Jianzhuang Liu
Wei Xiong
He Zhang
Liangliang Cao
Liangliang Cao
EGVM
66
84
0
27 Feb 2024
Generative AI in Vision: A Survey on Models, Metrics and Applications
Generative AI in Vision: A Survey on Models, Metrics and Applications
Gaurav Raut
Apoorv Singh
VLM
MedIm
36
6
0
26 Feb 2024
Gen4Gen: Generative Data Pipeline for Generative Multi-Concept
  Composition
Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition
Chun-Hsiao Yeh
Ta-Ying Cheng
He-Yen Hsieh
Chuan-En Lin
Yi Ma
Andrew Markham
Niki Trigoni
H. T. Kung
Yubei Chen
DiffM
25
3
0
23 Feb 2024
Hierarchical Invariance for Robust and Interpretable Vision Tasks at
  Larger Scales
Hierarchical Invariance for Robust and Interpretable Vision Tasks at Larger Scales
Shuren Qi
Yushu Zhang
Chao Wang
Zhihua Xia
Xiaochun Cao
Jian Weng
16
1
0
23 Feb 2024
Human Video Translation via Query Warping
Human Video Translation via Query Warping
Haiming Zhu
Yangyang Xu
Shengfeng He
DiffM
27
0
0
19 Feb 2024
ComFusion: Personalized Subject Generation in Multiple Specific Scenes
  From Single Image
ComFusion: Personalized Subject Generation in Multiple Specific Scenes From Single Image
Yan Hong
Jianfu Zhang
DiffM
20
3
0
19 Feb 2024
WildFake: A Large-scale Challenging Dataset for AI-Generated Images
  Detection
WildFake: A Large-scale Challenging Dataset for AI-Generated Images Detection
Yan Hong
Jianfu Zhang
67
9
0
19 Feb 2024
Visual Concept-driven Image Generation with Text-to-Image Diffusion Model
Visual Concept-driven Image Generation with Text-to-Image Diffusion Model
Tanzila Rahman
Shweta Mahajan
Hsin-Ying Lee
Jian Ren
Sergey Tulyakov
Leonid Sigal
80
4
0
18 Feb 2024
Make a Cheap Scaling: A Self-Cascade Diffusion Model for
  Higher-Resolution Adaptation
Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
Lanqing Guo
Yin-Yin He
Haoxin Chen
Menghan Xia
Xiaodong Cun
...
Yong Zhang
Xintao Wang
Qifeng Chen
Ying Shan
Bihan Wen
24
23
0
16 Feb 2024
GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object
  with Gaussian Splatting
GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting
Chen Yang
Sikuang Li
Jiemin Fang
Ruofan Liang
Lingxi Xie
Xiaopeng Zhang
Wei Shen
Qi Tian
3DGS
17
19
0
15 Feb 2024
Quantized Embedding Vectors for Controllable Diffusion Language Models
Quantized Embedding Vectors for Controllable Diffusion Language Models
Cheng Kang
Xinye Chen
Yong Hu
Daniel Novak
18
0
0
15 Feb 2024
Textual Localization: Decomposing Multi-concept Images for
  Subject-Driven Text-to-Image Generation
Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation
Junjie Shentu
Matthew Watson
Noura Al Moubayed
15
0
0
15 Feb 2024
Trustworthy SR: Resolving Ambiguity in Image Super-resolution via
  Diffusion Models and Human Feedback
Trustworthy SR: Resolving Ambiguity in Image Super-resolution via Diffusion Models and Human Feedback
Cansu Korkmaz
Ege Çirakman
A. Murat Tekalp
Zafer Do˘gan
21
0
0
12 Feb 2024
Diff-RNTraj: A Structure-aware Diffusion Model for Road
  Network-constrained Trajectory Generation
Diff-RNTraj: A Structure-aware Diffusion Model for Road Network-constrained Trajectory Generation
Tonglong Wei
Youfang Lin
S. Guo
Yan Lin
Yiheng Huang
Chenyang Xiang
Yuqing Bai
Menglu Ya
Huaiyu Wan
28
11
0
12 Feb 2024
Scalable Diffusion Models with State Space Backbone
Scalable Diffusion Models with State Space Backbone
Zhengcong Fei
Mingyuan Fan
Changqian Yu
Junshi Huang
62
33
0
08 Feb 2024
Towards Aligned Layout Generation via Diffusion Model with Aesthetic
  Constraints
Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints
Jian Chen
Ruiyi Zhang
Yufan Zhou
Rajiv Jain
Zhiqiang Xu
Ryan A. Rossi
Changyou Chen
DiffM
42
12
0
07 Feb 2024
InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with
  Semantic Graph Prior
InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior
Chenguo Lin
Yadong Mu
3DV
14
32
0
07 Feb 2024
DisDet: Exploring Detectability of Backdoor Attack on Diffusion Models
DisDet: Exploring Detectability of Backdoor Attack on Diffusion Models
Yang Sui
Huy Phan
Jinqi Xiao
Tian-Di Zhang
Zijie Tang
Cong Shi
Yan Wang
Yingying Chen
Bo Yuan
DiffM
AAML
16
12
0
05 Feb 2024
Separable Multi-Concept Erasure from Diffusion Models
Separable Multi-Concept Erasure from Diffusion Models
Mengnan Zhao
Lihe Zhang
Tianhang Zheng
Yuqiu Kong
Baocai Yin
41
9
0
03 Feb 2024
A Single Simple Patch is All You Need for AI-generated Image Detection
A Single Simple Patch is All You Need for AI-generated Image Detection
Jiaxuan Chen
Jieteng Yao
Li Niu
11
22
0
02 Feb 2024
Diffusion Facial Forgery Detection
Diffusion Facial Forgery Detection
Harry Cheng
Yangyang Guo
Tianyi Wang
L. Nie
Mohan S. Kankanhalli
56
16
0
29 Jan 2024
CCA: Collaborative Competitive Agents for Image Editing
CCA: Collaborative Competitive Agents for Image Editing
Tiankai Hang
Shuyang Gu
Dong Chen
Xin Geng
Baining Guo
20
5
0
23 Jan 2024
Detecting Multimedia Generated by Large AI Models: A Survey
Detecting Multimedia Generated by Large AI Models: A Survey
Li Lin
Neeraj Gupta
Yue Zhang
Hainan Ren
Chun-Hao Liu
Feng Ding
Xin Eric Wang
X. Li
Luisa Verdoliva
Shu Hu
75
56
0
22 Jan 2024
Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation
  with Deterministic Sampling Prior
Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation with Deterministic Sampling Prior
Zike Wu
Pan Zhou
Xuanyu Yi
Xiaoding Yuan
Hanwang Zhang
DiffM
23
36
0
17 Jan 2024
VideoCrafter2: Overcoming Data Limitations for High-Quality Video
  Diffusion Models
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Haoxin Chen
Yong Zhang
Xiaodong Cun
Menghan Xia
Xintao Wang
Chao-Liang Weng
Ying Shan
VGen
DiffM
115
274
0
17 Jan 2024
Revealing Vulnerabilities in Stable Diffusion via Targeted Attacks
Revealing Vulnerabilities in Stable Diffusion via Targeted Attacks
Chenyu Zhang
Lanjun Wang
Anan Liu
24
6
0
16 Jan 2024
Multi-scale 2D Temporal Map Diffusion Models for Natural Language Video
  Localization
Multi-scale 2D Temporal Map Diffusion Models for Natural Language Video Localization
Chongzhi Zhang
Mingyuan Zhang
Zhiyang Teng
Jiayi Li
Xizhou Zhu
Lewei Lu
Ziwei Liu
Aixin Sun
DiffM
VGen
13
0
0
16 Jan 2024
Improving Diffusion-Based Image Synthesis with Context Prediction
Improving Diffusion-Based Image Synthesis with Context Prediction
Ling Yang
Jingwei Liu
Shenda Hong
Zhilong Zhang
Zhilin Huang
Zheming Cai
Wentao Zhang
Bin Cui
DiffM
38
33
0
04 Jan 2024
HQ-VAE: Hierarchical Discrete Representation Learning with Variational
  Bayes
HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes
Yuhta Takida
Yukara Ikemiya
Takashi Shibuya
Kazuki Shimada
Woosung Choi
...
Naoki Murata
Toshimitsu Uesaka
Kengo Uchida
Wei-Hsiang Liao
Yuki Mitsufuji
BDL
30
11
0
31 Dec 2023
FlashVideo: A Framework for Swift Inference in Text-to-Video Generation
FlashVideo: A Framework for Swift Inference in Text-to-Video Generation
Bin Lei
Le Chen
Caiwen Ding
VGen
20
1
0
30 Dec 2023
Classifier-free graph diffusion for molecular property targeting
Classifier-free graph diffusion for molecular property targeting
Matteo Ninniri
Marco Podda
Davide Bacciu
30
5
0
28 Dec 2023
Forgery-aware Adaptive Transformer for Generalizable Synthetic Image
  Detection
Forgery-aware Adaptive Transformer for Generalizable Synthetic Image Detection
Huan Liu
Zichang Tan
Chuangchuang Tan
Yunchao Wei
Yao-Min Zhao
Jingdong Wang
ViT
26
42
0
27 Dec 2023
Towards Flexible, Scalable, and Adaptive Multi-Modal Conditioned Face
  Synthesis
Towards Flexible, Scalable, and Adaptive Multi-Modal Conditioned Face Synthesis
Jingjing Ren
Cheng Xu
Haoyu Chen
Xinran Qin
Lei Zhu
CVBM
DiffM
24
4
0
26 Dec 2023
Emage: Non-Autoregressive Text-to-Image Generation
Emage: Non-Autoregressive Text-to-Image Generation
Zhangyin Feng
Runyi Hu
Liangxin Liu
Fan Zhang
Duyu Tang
Yong Dai
Xiaocheng Feng
Jiwei Li
Bing Qin
Shuming Shi
DiffM
VLM
14
0
0
22 Dec 2023
Generative AI Beyond LLMs: System Implications of Multi-Modal Generation
Generative AI Beyond LLMs: System Implications of Multi-Modal Generation
Alicia Golden
Samuel Hsia
Fei Sun
Bilge Acun
Basil Hosmer
...
Zachary DeVito
Jeff Johnson
Gu-Yeon Wei
David Brooks
Carole-Jean Wu
VLM
DiffM
22
8
0
22 Dec 2023
Diffusion Reward: Learning Rewards via Conditional Video Diffusion
Diffusion Reward: Learning Rewards via Conditional Video Diffusion
Tao Huang
Guangqi Jiang
Yanjie Ze
Huazhe Xu
VGen
26
22
0
21 Dec 2023
Previous
123456...101112
Next