Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.08332
Cited By
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model
15 November 2022
Xingqian Xu
Zhangyang Wang
Eric Zhang
Kai Wang
Humphrey Shi
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Versatile Diffusion: Text, Images and Variations All in One Diffusion Model"
39 / 139 papers shown
Title
UniBrain: Unify Image Reconstruction and Captioning All in One Diffusion Model from Human Brain Activity
Weijian Mai
Zhijun Zhang
DiffM
14
31
0
14 Aug 2023
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models
Hu Ye
Jun Zhang
Siyi Liu
Xiao Han
Wei Yang
DiffM
21
725
0
13 Aug 2023
Circumventing Concept Erasure Methods For Text-to-Image Generative Models
Minh Pham
Kelly O. Marshall
Niv Cohen
Govind Mittal
C. Hegde
DiffM
14
37
0
03 Aug 2023
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation
Yasheng Sun
Yifan Yang
Houwen Peng
Yifei Shen
Yuqing Yang
Hang-Rui Hu
Lili Qiu
Hideki Koike
DiffM
LM&Ro
27
33
0
02 Aug 2023
Reference-based Painterly Inpainting via Diffusion: Crossing the Wild Reference Domain Gap
Dejia Xu
Xingqian Xu
Wenyan Cong
Humphrey Shi
Zhangyang Wang
DiffM
21
4
0
20 Jul 2023
FreeDrag: Feature Dragging for Reliable Point-based Image Editing
Pengyang Ling
Lin Chen
Pan Zhang
H. Chen
Yi Jin
Jinjin Zheng
DiffM
21
13
0
10 Jul 2023
JourneyDB: A Benchmark for Generative Image Understanding
Keqiang Sun
Junting Pan
Yuying Ge
Hao Li
Haodong Duan
...
Yi Wang
Jifeng Dai
Yu Qiao
Limin Wang
Hongsheng Li
31
100
0
03 Jul 2023
DisCo: Disentangled Control for Realistic Human Dance Generation
Tan Wang
Linjie Li
Kevin Qinghong Lin
Yuanhao Zhai
Chung-Ching Lin
Zhengyuan Yang
Hanwang Zhang
Zicheng Liu
Lijuan Wang
VGen
13
70
0
30 Jun 2023
Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
Xiaoshi Wu
Yiming Hao
Keqiang Sun
Yixiong Chen
Feng Zhu
Rui Zhao
Hongsheng Li
14
251
0
15 Jun 2023
Generative Semantic Communication: Diffusion Models Beyond Bit Recovery
Eleonora Grassucci
Sergio Barbarossa
Danilo Comminiello
DiffM
17
53
0
07 Jun 2023
RealignDiff: Boosting Text-to-Image Diffusion Model with Coarse-to-fine Semantic Re-alignment
Guian Fang
Zutao Jiang
Jianhua Han
Guangsong Lu
Hang Xu
Shengcai Liao
Xiaodan Liang
EGVM
19
1
0
31 May 2023
Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors
Paul S. Scotti
Atmadeep Banerjee
J. Goode
Stepan Shabalin
A. Nguyen
...
Nathalie Verlinde
Elad Yundler
David Weisberg
K. A. Norman
Tanishq Mathew Abraham
DiffM
21
105
0
29 May 2023
On Evaluating Adversarial Robustness of Large Vision-Language Models
Yunqing Zhao
Tianyu Pang
Chao Du
Xiao Yang
Chongxuan Li
Ngai-man Cheung
Min-Bin Lin
VLM
AAML
MLLM
6
166
0
26 May 2023
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models
Xingqian Xu
Jiayi Guo
Zhangyang Wang
Gao Huang
Irfan Essa
Humphrey Shi
VLM
DiffM
25
57
0
25 May 2023
MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation
Marco Bellagente
Manuel Brack
H. Teufel
Felix Friedrich
Bjorn Deiseroth
...
Koen Oostermeijer
Andres Felipe Cruz Salinas
P. Schramowski
Kristian Kersting
Samuel Weinbach
33
15
0
24 May 2023
Any-to-Any Generation via Composable Diffusion
Zineng Tang
Ziyi Yang
Chenguang Zhu
Michael Zeng
Mohit Bansal
VGen
DiffM
18
169
0
19 May 2023
In-Context Learning Unlocked for Diffusion Models
Zhendong Wang
Yifan Jiang
Yadong Lu
Yelong Shen
Pengcheng He
Weizhu Chen
Zhangyang Wang
Mingyuan Zhou
VLM
DiffM
86
68
0
01 May 2023
ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation
Jiazheng Xu
Xiao Liu
Yuchen Wu
Yuxuan Tong
Qinkai Li
Ming Ding
Jie Tang
Yuxiao Dong
15
306
0
12 Apr 2023
Zero-shot Generative Model Adaptation via Image-specific Prompt Learning
Jiayi Guo
Chaofei Wang
You Wu
Eric Zhang
Kai Wang
Xingqian Xu
S. Song
Humphrey Shi
Gao Huang
DiffM
VLM
66
28
0
06 Apr 2023
Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models
Eric Zhang
Kai Wang
Xingqian Xu
Zhangyang Wang
Humphrey Shi
DiffM
42
169
0
30 Mar 2023
PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image Editor
Vidit Goel
E. Peruzzo
Yifan Jiang
Dejia Xu
Xingqian Xu
N. Sebe
Trevor Darrell
Zhangyang Wang
Humphrey Shi
DiffM
12
6
0
30 Mar 2023
CoBIT: A Contrastive Bi-directional Image-Text Generation Model
Haoxuan You
Mandy Guo
Zhecan Wang
Kai-Wei Chang
Jason Baldridge
Jiahui Yu
DiffM
37
12
0
23 Mar 2023
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators
Levon Khachatryan
A. Movsisyan
Vahram Tadevosyan
Roberto Henschel
Zhangyang Wang
Shant Navasardyan
Humphrey Shi
VGen
16
520
0
23 Mar 2023
On the De-duplication of LAION-2B
Ryan Webster
Julien Rabin
Loïc Simon
F. Jurie
DiffM
10
40
0
17 Mar 2023
Text-to-image Diffusion Models in Generative AI: A Survey
Chenshuang Zhang
Chaoning Zhang
Mengchun Zhang
In So Kweon
VLM
42
263
0
14 Mar 2023
Diffusion Models for Non-autoregressive Text Generation: A Survey
Yifan Li
Kun Zhou
Wayne Xin Zhao
Ji-Rong Wen
MedIm
DiffM
37
31
0
12 Mar 2023
One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale
Fan Bao
Shen Nie
Kaiwen Xue
Chongxuan Li
Shiliang Pu
Yaole Wang
Gang Yue
Yue Cao
Hang Su
Jun Zhu
DiffM
199
147
0
12 Mar 2023
Natural scene reconstruction from fMRI signals using generative latent diffusion
Furkan Ozcelik
Rufin VanRullen
DiffM
90
42
0
09 Mar 2023
Leaving Reality to Imagination: Robust Classification via Generated Datasets
Hritik Bansal
Aditya Grover
OOD
34
86
0
05 Feb 2023
Do DALL-E and Flamingo Understand Each Other?
Hang Li
Jindong Gu
Rajat Koner
Sahand Sharifzadeh
Volker Tresp
MLLM
16
12
0
23 Dec 2022
A Survey on Generative Diffusion Model
Hanqun Cao
Cheng Tan
Zhangyang Gao
Yilun Xu
Guangyong Chen
Pheng-Ann Heng
Stan Z. Li
MedIm
37
195
0
06 Sep 2022
Diffusion Models: A Comprehensive Survey of Methods and Applications
Ling Yang
Zhilong Zhang
Yingxia Shao
Shenda Hong
Runsheng Xu
Yue Zhao
Wentao Zhang
Bin Cui
Ming-Hsuan Yang
DiffM
MedIm
215
1,277
0
02 Sep 2022
A survey of multimodal deep generative models
Masahiro Suzuki
Y. Matsuo
SyDa
DRL
43
75
0
05 Jul 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
388
4,010
0
28 Jan 2022
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,735
0
24 Feb 2021
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts
Soravit Changpinyo
P. Sharma
Nan Ding
Radu Soricut
VLM
273
1,077
0
17 Feb 2021
Image-to-Image Translation with Conditional Adversarial Networks
Phillip Isola
Jun-Yan Zhu
Tinghui Zhou
Alexei A. Efros
SSeg
212
19,191
0
21 Nov 2016
Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Akira Fukui
Dong Huk Park
Daylen Yang
Anna Rohrbach
Trevor Darrell
Marcus Rohrbach
144
1,458
0
06 Jun 2016
Pixel Recurrent Neural Networks
Aaron van den Oord
Nal Kalchbrenner
Koray Kavukcuoglu
SSeg
GAN
225
2,542
0
25 Jan 2016
Previous
1
2
3