Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.10752
Cited By
High-Resolution Image Synthesis with Latent Diffusion Models
20 December 2021
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"High-Resolution Image Synthesis with Latent Diffusion Models"
50 / 8,007 papers shown
Title
Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale
Qichao Wang
Tian Bian
Yian Yin
Tingyang Xu
Hong Cheng
Helen M. Meng
Zibin Zheng
Liang Chen
Bingzhe Wu
VLM
DiffM
28
3
0
18 Oct 2023
GenEval: An Object-Focused Framework for Evaluating Text-to-Image Alignment
Dhruba Ghosh
Hanna Hajishirzi
Ludwig Schmidt
9
134
0
17 Oct 2023
EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
Yaofang Liu
Xiaodong Cun
Xuebo Liu
Xintao Wang
Yong Zhang
Haoxin Chen
Yang Liu
Tieyong Zeng
Raymond H. F. Chan
Ying Shan
VGen
EGVM
11
127
0
17 Oct 2023
Elucidating The Design Space of Classifier-Guided Diffusion Generation
Jiajun Ma
Tianyang Hu
Wenjia Wang
Jiacheng Sun
25
9
0
17 Oct 2023
Leveraging Diverse Semantic-based Audio Pretrained Models for Singing Voice Conversion
Xueyao Zhang
Yicheng Gu
Haopeng Chen
Zihao Fang
Lexiao Zou
Junan Zhang
Liumeng Xue
Jinchao Zhang
Jie Zhou
Zhizheng Wu
DiffM
24
1
0
17 Oct 2023
BayesDiff: Estimating Pixel-wise Uncertainty in Diffusion via Bayesian Inference
Siqi Kou
Lei Gan
Dequan Wang
Chongxuan Li
Zhijie Deng
BDL
DiffM
21
7
0
17 Oct 2023
Towards Training-free Open-world Segmentation via Image Prompt Foundation Models
Lv Tang
Peng-Tao Jiang
Haoke Xiao
Bo Li
VLM
8
7
0
17 Oct 2023
LAMP: Learn A Motion Pattern for Few-Shot-Based Video Generation
Ruiqi Wu
Liangyu Chen
Tong Yang
Chunle Guo
Chongyi Li
Xiangyu Zhang
DiffM
VGen
86
52
0
16 Oct 2023
BiomedJourney: Counterfactual Biomedical Image Generation by Instruction-Learning from Multimodal Patient Journeys
Yu Gu
Jianwei Yang
Naoto Usuyama
Chun-yue Li
Sheng Zhang
M. Lungren
Jianfeng Gao
Hoifung Poon
MedIm
22
22
0
16 Oct 2023
A Survey on Video Diffusion Models
Zhen Xing
Qijun Feng
Haoran Chen
Qi Dai
Hang-Rui Hu
Hang Xu
Zuxuan Wu
Yu-Gang Jiang
EGVM
VGen
55
115
0
16 Oct 2023
TOSS:High-quality Text-guided Novel View Synthesis from a Single Image
Yukai Shi
Jianan Wang
He Cao
Boshi Tang
Xianbiao Qi
Tianyu Yang
Yukun Huang
Shilong Liu
Lei Zhang
H. Shum
DiffM
14
20
0
16 Oct 2023
LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts
Hanan Gani
Shariq Farooq Bhat
Muzammal Naseer
Salman Khan
Peter Wonka
DiffM
42
38
0
16 Oct 2023
DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing
Jia-Wei Liu
Yan-Pei Cao
Jay Zhangjie Wu
Weijia Mao
Yuchao Gu
Rui Zhao
Jussi Keppo
Ying Shan
Mike Zheng Shou
VGen
DiffM
30
14
0
16 Oct 2023
ForceGen: End-to-end de novo protein generation based on nonlinear mechanical unfolding responses using a protein language diffusion model
Bo Ni
David L. Kaplan
Markus J. Buehler
DiffM
26
5
0
16 Oct 2023
Generation or Replication: Auscultating Audio Latent Diffusion Models
Dimitrios Bralios
G. Wichern
François G. Germain
Zexu Pan
Sameer Khurana
Chiori Hori
Jonathan Le Roux
DiffM
19
6
0
16 Oct 2023
Interpreting and Controlling Vision Foundation Models via Text Explanations
Haozhe Chen
Junfeng Yang
Carl Vondrick
Chengzhi Mao
11
2
0
16 Oct 2023
ViPE: Visualise Pretty-much Everything
Hassan Shahmohammadi
Adhiraj Ghosh
Hendrik P. A. Lensch
DiffM
23
1
0
16 Oct 2023
Real-Fake: Effective Training Data Synthesis Through Distribution Matching
Jianhao Yuan
Jie Zhang
Shuyang Sun
Philip H. S. Torr
Bo-Lu Zhao
23
22
0
16 Oct 2023
ConsistNet: Enforcing 3D Consistency for Multi-view Images Diffusion
Jiayu Yang
Ziang Cheng
Yunfei Duan
Pan Ji
Hongdong Li
DiffM
39
53
0
16 Oct 2023
Scene Graph Conditioning in Latent Diffusion
Frank Fundel
DiffM
27
0
0
16 Oct 2023
Towards image compression with perfect realism at ultra-low bitrates
Marlene Careil
Matthew Muckley
Jakob Verbeek
Stéphane Lathuilière
DiffM
24
44
0
16 Oct 2023
Self-supervised Fetal MRI 3D Reconstruction Based on Radiation Diffusion Generation Model
Junpeng Tan
Xin Zhang
Yao Lv
Xiangmin Xu
Gang Li
DiffM
MedIm
25
0
0
16 Oct 2023
MoConVQ: Unified Physics-Based Motion Control via Scalable Discrete Representations
Heyuan Yao
Zhenhua Song
Yuyang Zhou
Tenglong Ao
Baoquan Chen
Libin Liu
11
38
0
16 Oct 2023
AutoDIR: Automatic All-in-One Image Restoration with Latent Diffusion
Yitong Jiang
Zhaoyang Zhang
Tianfan Xue
Jinwei Gu
DiffM
32
43
0
16 Oct 2023
Ring-A-Bell! How Reliable are Concept Removal Methods for Diffusion Models?
Yu-Lin Tsai
Chia-Yi Hsu
Chulin Xie
Chih-Hsun Lin
Jia-You Chen
Bo-wen Li
Pin-Yu Chen
Chia-Mu Yu
Chun-ying Huang
DiffM
28
76
0
16 Oct 2023
Prompting for Discovery: Flexible Sense-Making for AI Art-Making with Dreamsheets
Shm Garanganao
J.D. Zamfirescu-Pereira
Kyu Won Kim
Mani Rathnam
Bjoern Hartmann
32
29
0
15 Oct 2023
Farzi Data: Autoregressive Data Distillation
Noveen Sachdeva
Zexue He
Wang-Cheng Kang
Jianmo Ni
D. Cheng
Julian McAuley
DD
17
3
0
15 Oct 2023
ProteusNeRF: Fast Lightweight NeRF Editing using 3D-Aware Image Context
Binglun Wang
Niladri Shekhar Dutt
Niloy J. Mitra
38
10
0
15 Oct 2023
Unsupervised Discovery of Interpretable Directions in h-space of Pre-trained Diffusion Models
Zijian Zhang
Luping Liu
Zhijie Lin
Yichen Zhu
Zhou Zhao
DiffM
29
4
0
15 Oct 2023
LOVECon: Text-driven Training-Free Long Video Editing with ControlNet
Zhenyi Liao
Zhijie Deng
DiffM
19
7
0
15 Oct 2023
Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space
Hengrui Zhang
Jiani Zhang
Balasubramaniam Srinivasan
Zhengyuan Shen
Xiao Qin
Christos Faloutsos
Huzefa Rangwala
George Karypis
DiffM
27
80
0
14 Oct 2023
Towards More Accurate Diffusion Model Acceleration with A Timestep Aligner
Mengfei Xia
Yujun Shen
Changsong Lei
Yu Zhou
Ran Yi
Deli Zhao
Wenping Wang
Yong-jin Liu
16
5
0
14 Oct 2023
PaintHuman: Towards High-fidelity Text-to-3D Human Texturing via Denoised Score Distillation
Jianhui Yu
Hao Zhu
Liming Jiang
Chen Change Loy
Weidong Cai
Wayne Wu
DiffM
26
2
0
14 Oct 2023
Integrating Symbolic Reasoning into Neural Generative Models for Design Generation
Maxwell J. Jacobson
Yexiang Xue
NAI
30
0
0
13 Oct 2023
Vision-by-Language for Training-Free Compositional Image Retrieval
Shyamgopal Karthik
Karsten Roth
Massimiliano Mancini
Zeynep Akata
CoGe
26
52
0
13 Oct 2023
Hypernymy Understanding Evaluation of Text-to-Image Models via WordNet Hierarchy
Anton Baryshnikov
Max Ryabinin
VLM
14
2
0
13 Oct 2023
Discovery and Expansion of New Domains within Diffusion Models
Ye Zhu
Yu Wu
Duo Xu
Zhiwei Deng
Yan Yan
Olga Russakovsky
DiffM
26
1
0
13 Oct 2023
CopyScope: Model-level Copyright Infringement Quantification in the Diffusion Workflow
Junlei Zhou
Jiashi Gao
Ziwei Wang
Xuetao Wei
18
2
0
13 Oct 2023
MINDE: Mutual Information Neural Diffusion Estimation
Giulio Franzese
Mustapha Bounoua
Pietro Michiardi
DiffM
19
7
0
13 Oct 2023
EasyGen: Easing Multimodal Generation with BiDiffuser and LLMs
Xiangyu Zhao
Bo Liu
Qijiong Liu
Guangyuan Shi
Xiao-Ming Wu
VLM
DiffM
21
7
0
13 Oct 2023
Extending Multi-modal Contrastive Representations
Zehan Wang
Ziang Zhang
Luping Liu
Yang Zhao
Haifeng Huang
Tao Jin
Zhou Zhao
19
5
0
13 Oct 2023
R&B: Region and Boundary Aware Zero-shot Grounded Text-to-image Generation
Jiayu Xiao
Henglei Lv
Liang Li
Shuhui Wang
Qingming Huang
DiffM
24
20
0
13 Oct 2023
DDMT: Denoising Diffusion Mask Transformer Models for Multivariate Time Series Anomaly Detection
Chaocheng Yang
Tingyin Wang
Xuanhui Yan
DiffM
24
7
0
13 Oct 2023
CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models
Sreyan Ghosh
Ashish Seth
Sonal Kumar
Utkarsh Tyagi
Chandra Kiran Reddy Evuru
S. Ramaneswaran
S. Sakshi
Oriol Nieto
R. Duraiswami
Dinesh Manocha
AuLLM
VLM
CoGe
35
21
0
12 Oct 2023
Histogram- and Diffusion-Based Medical Out-of-Distribution Detection
Evi M. C. Huijben
S. Amirrajab
J. Pluim
MedIm
DiffM
34
1
0
12 Oct 2023
OmniControl: Control Any Joint at Any Time for Human Motion Generation
Yiming Xie
Varun Jampani
Lei Zhong
Deqing Sun
Huaizu Jiang
DiffM
24
108
0
12 Oct 2023
HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion
Xian Liu
Jian Ren
Aliaksandr Siarohin
Ivan Skorokhodov
Yanyu Li
Dahua Lin
Xihui Liu
Ziwei Liu
Sergey Tulyakov
32
57
0
12 Oct 2023
Jigsaw: Supporting Designers to Prototype Multimodal Applications by Chaining AI Foundation Models
David Chuan-En Lin
Nikolas Martelaro
24
18
0
12 Oct 2023
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation
Zhengyuan Yang
Jianfeng Wang
Linjie Li
Kevin Qinghong Lin
Chung-Ching Lin
Zicheng Liu
Lijuan Wang
LRM
MLLM
DiffM
13
22
0
12 Oct 2023
Animating Street View
Mengyi Shan
Brian L. Curless
Ira Kemelmacher-Shlizerman
Steven M. Seitz
33
2
0
12 Oct 2023
Previous
1
2
3
...
146
147
148
...
159
160
161
Next