Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.10752
Cited By
High-Resolution Image Synthesis with Latent Diffusion Models
20 December 2021
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"High-Resolution Image Synthesis with Latent Diffusion Models"
50 / 7,956 papers shown
Title
MagicEdit: High-Fidelity and Temporally Coherent Video Editing
Jun Hao Liew
Hanshu Yan
Jianfeng Zhang
Zhongcong Xu
Jiashi Feng
VGen
DiffM
17
52
0
28 Aug 2023
Total Selfie: Generating Full-Body Selfies
B. Chen
Brian L. Curless
Ira Kemelmacher-Shlizerman
S. M. Seitz
DiffM
34
4
0
28 Aug 2023
LatentDR: Improving Model Generalization Through Sample-Aware Latent Degradation and Restoration
Ran Liu
Sahil Khose
Jingyun Xiao
Lakshmi Sathidevi
Keerthan Ramnath
Z. Kira
Eva L. Dyer
19
3
0
28 Aug 2023
Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization
Tao Yang
Rongyuan Wu
Peiran Ren
Xuansong Xie
Lei Zhang
DiffM
34
136
0
28 Aug 2023
ExpCLIP: Bridging Text and Facial Expressions via Semantic Alignment
Yicheng Zhong
Huawei Wei
Pei-Yin Yang
Zhisheng Wang
CLIP
19
6
0
28 Aug 2023
Mobile Foundation Model as Firmware
Jinliang Yuan
Chenchen Yang
Dongqi Cai
Shihe Wang
Xin Yuan
...
Di Zhang
Hanzi Mei
Xianqing Jia
Shangguang Wang
Mengwei Xu
32
19
0
28 Aug 2023
HoloFusion: Towards Photo-realistic 3D Generative Modeling
Animesh Karnewar
Niloy J. Mitra
Andrea Vedaldi
David Novotny
DiffM
26
31
0
28 Aug 2023
Spoken Language Intelligence of Large Language Models for Language Learning
Linkai Peng
Baorian Nuchged
Yingming Gao
ELM
57
4
0
28 Aug 2023
SketchDreamer: Interactive Text-Augmented Creative Sketch Ideation
Zhiyu Qu
Tao Xiang
Yi-Zhe Song
DiffM
34
11
0
27 Aug 2023
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
Lin Geng Foo
Hossein Rahmani
J. Liu
62
31
0
27 Aug 2023
Sparse3D: Distilling Multiview-Consistent Diffusion for Object Reconstruction from Sparse Views
Zi-Xin Zou
Weihao Cheng
Yan-Pei Cao
Shi-Sheng Huang
Ying Shan
Songiie Zhang
DiffM
19
23
0
27 Aug 2023
CHORUS: Learning Canonicalized 3D Human-Object Spatial Relations from Unbounded Synthesized Images
Sookwan Han
Hanbyul Joo
14
14
0
23 Aug 2023
Manipulating Embeddings of Stable Diffusion Prompts
Niklas Deckers
Julia Peters
Martin Potthast
DiffM
32
9
0
23 Aug 2023
Boosting Diffusion Models with an Adaptive Momentum Sampler
Xiyu Wang
Anh-Dung Dinh
Daochang Liu
Chang Xu
13
4
0
23 Aug 2023
Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning
Jiasheng Ye
Zaixiang Zheng
Yu Bao
Lihua Qian
Quanquan Gu
DiffM
52
14
0
23 Aug 2023
Hey That's Mine Imperceptible Watermarks are Preserved in Diffusion Generated Outputs
Luke Ditria
Tom Drummond
WIGM
19
2
0
22 Aug 2023
The DKU-DUKEECE System for the Manipulation Region Location Task of ADD 2023
Zexin Cai
Weiqing Wang
Yikang Wang
Ming Li
17
6
0
20 Aug 2023
Make-It-4D: Synthesizing a Consistent Long-Term Dynamic Scene Video from a Single Image
Liao Shen
Xingyi Li
Huiqiang Sun
Juewen Peng
Ke Xian
Zhiguo Cao
Guo-Shing Lin
DiffM
27
14
0
20 Aug 2023
MeDM: Mediating Image Diffusion Models for Video-to-Video Translation with Temporal Correspondence Guidance
Ernie Chu
Tzu-Hua Huang
Shuohao Lin
Jun-Cheng Chen
DiffM
VGen
19
13
0
19 Aug 2023
AltDiffusion: A Multilingual Text-to-Image Diffusion Model
Fulong Ye
Guangyi Liu
Xinya Wu
Ledell Yu Wu
VLM
27
25
0
19 Aug 2023
Learning Representations on Logs for AIOps
Pranjal Gupta
Harshit Kumar
Debanjana Kar
Karan Bhukar
Pooja Aggarwal
P. Mohapatra
20
11
0
18 Aug 2023
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Ouyang Hao
Qiuyu Wang
Yuxi Xiao
Qingyan Bai
Juntao Zhang
Kecheng Zheng
Xiaowei Zhou
Qifeng Chen
Yujun Shen
DiffM
VGen
41
81
0
15 Aug 2023
Link-Context Learning for Multimodal LLMs
Yan Tai
Weichen Fan
Zhao Zhang
Feng Zhu
Rui Zhao
Ziwei Liu
ReLM
LRM
21
17
0
15 Aug 2023
StyleDiffusion: Controllable Disentangled Style Transfer via Diffusion Models
Zhizhong Wang
Lei Zhao
Wei Xing
DiffM
27
119
0
15 Aug 2023
Steering Language Generation: Harnessing Contrastive Expert Guidance and Negative Prompting for Coherent and Diverse Synthetic Data Generation
Charles OÑeill
Y. Ting 丁
I. Ciucă
Jack Miller
Thang Bui
SyDa
29
1
0
15 Aug 2023
MarkovGen: Structured Prediction for Efficient Text-to-Image Generation
Sadeep Jayasumana
Daniel Glasner
Srikumar Ramalingam
Andreas Veit
Ayan Chakrabarti
Surinder Kumar
DiffM
19
0
0
14 Aug 2023
FocusFlow: Boosting Key-Points Optical Flow Estimation for Autonomous Driving
Zhonghua Yi
Haowen Shi
Kailun Yang
Qi Jiang
Yaozu Ye
Ze Wang
Huajian Ni
Kaiwei Wang
3DPC
15
9
0
14 Aug 2023
SpeechX: Neural Codec Language Model as a Versatile Speech Transformer
Xiaofei Wang
Manthan Thakker
Zhuo Chen
Naoyuki Kanda
Sefik Emre Eskimez
Sanyuan Chen
M. Tang
Shujie Liu
Jinyu Li
Takuya Yoshioka
18
79
0
14 Aug 2023
Precipitation nowcasting with generative diffusion models
Andrea Asperti
Fabio Merizzi
Alberto Paparella
G. Pedrazzi
M. Angelinelli
Stefano Colamonaco
DiffM
25
18
0
13 Aug 2023
LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts
Binbin Yang
Yinzheng Luo
Ziliang Chen
Guangrun Wang
Xiaodan Liang
Liang Lin
DiffM
19
12
0
13 Aug 2023
Large Language Models and Foundation Models in Smart Agriculture: Basics, Opportunities, and Challenges
Jiajia Li
Mingle Xu
Lirong Xiang
Dong Chen
Weichao Zhuang
Xunyuan Yin
Zhao Li
25
3
0
13 Aug 2023
TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image Super-Resolution
Baolin Liu
Zongyuan Yang
Pengfei Wang
Junjie Zhou
Ziqi Liu
Ziyi Song
Yan Liu
Yongping Xiong
26
7
0
13 Aug 2023
VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use
Yonatan Bitton
Hritik Bansal
Jack Hessel
Rulin Shao
Wanrong Zhu
Anas Awadalla
Josh Gardner
Rohan Taori
L. Schimdt
VLM
29
77
0
12 Aug 2023
White-box Membership Inference Attacks against Diffusion Models
Yan Pang
Tianhao Wang
Xu Kang
Mengdi Huai
Yang Zhang
AAML
DiffM
31
22
0
11 Aug 2023
Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation
Yuki Endo
25
8
0
11 Aug 2023
Illumination and Shadows in Head Rotation: experiments with Denoising Diffusion Models
Andrea Asperti
Gabriele Colasuonno
António Guerra
DiffM
60
1
0
11 Aug 2023
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining
Haohe Liu
Yiitan Yuan
Xubo Liu
Xinhao Mei
Qiuqiang Kong
Qiao Tian
Yuping Wang
Wenwu Wang
Yuxuan Wang
Mark D. Plumbley
DiffM
17
220
0
10 Aug 2023
JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models
Peike Li
Bo-Yu Chen
Yao Yao
Yikai Wang
Allen Wang
Alex Jinpeng Wang
MGen
VLM
DiffM
57
37
0
09 Aug 2023
Generative Approach for Probabilistic Human Mesh Recovery using Diffusion Models
Hanbyel Cho
Junmo Kim
3DH
14
7
0
05 Aug 2023
A Review of Change of Variable Formulas for Generative Modeling
Ullrich Kothe
11
6
0
04 Aug 2023
VisAlign: Dataset for Measuring the Degree of Alignment between AI and Humans in Visual Perception
Jiyoung Lee
Seung Wook Kim
Seunghyun Won
Joonseok Lee
Marzyeh Ghassemi
James Thorne
Jaeseok Choi
O.-Kil Kwon
E. Choi
18
1
0
03 Aug 2023
Reverse Stable Diffusion: What prompt was used to generate this image?
Florinel-Alin Croitoru
Vlad Hondru
Radu Tudor Ionescu
M. Shah
VLM
DiffM
23
5
0
02 Aug 2023
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation
Yasheng Sun
Yifan Yang
Houwen Peng
Yifei Shen
Yuqing Yang
Hang-Rui Hu
Lili Qiu
Hideki Koike
DiffM
LM&Ro
27
33
0
02 Aug 2023
Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models
Cheng-Yu Hsieh
Sibei Chen
Chun-Liang Li
Yasuhisa Fujii
Alexander Ratner
Chen-Yu Lee
Ranjay Krishna
Tomas Pfister
LLMAG
SyDa
29
41
0
01 Aug 2023
Detecting Cloud Presence in Satellite Images Using the RGB-based CLIP Vision-Language Model
Mikolaj Czerkawski
Robert C. Atkinson
Christos Tachtatzis
VLM
10
2
0
01 Aug 2023
MetaDiff: Meta-Learning with Conditional Diffusion for Few-Shot Learning
Baoquan Zhang
Chuyao Luo
Demin Yu
Huiwei Lin
Xutao Li
Yunming Ye
Bowen Zhang
DiffM
25
42
0
31 Jul 2023
MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text
Junchen Zhu
Huan Yang
Wenjing Wang
Huiguo He
Zixi Tuo
...
Wen-Huang Cheng
Lianli Gao
Jingkuan Song
Jianlong Fu
Jiebo Luo
DiffM
23
6
0
31 Jul 2023
UniBriVL: Robust Universal Representation and Generation of Audio Driven Diffusion Models
Sen Fang
Bowen Gao
Yangjian Wu
T. Teoh
DiffM
18
1
0
29 Jul 2023
CLIP Brings Better Features to Visual Aesthetics Learners
Liwu Xu
Jinjin Xu
Yuzhe Yang
Yi-Jie Huang
Yanchun Xie
Yaqian Li
VLM
20
3
0
28 Jul 2023
Minimally-Supervised Speech Synthesis with Conditional Diffusion Model and Language Model: A Comparative Study of Semantic Coding
Chunyu Qiang
Hao Li
Hao Ni
He Qu
Ruibo Fu
Tao Wang
Longbiao Wang
J. Dang
DiffM
27
8
0
28 Jul 2023
Previous
1
2
3
...
149
150
151
...
158
159
160
Next