Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.10752
Cited By
High-Resolution Image Synthesis with Latent Diffusion Models
20 December 2021
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"High-Resolution Image Synthesis with Latent Diffusion Models"
50 / 8,036 papers shown
Title
Image Clustering Conditioned on Text Criteria
Sehyun Kwon
Jaeseung Park
Minkyu Kim
Jaewoong Cho
Ernest K. Ryu
Kangwook Lee
VLM
34
11
0
27 Oct 2023
Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation
Jaemin Cho
Yushi Hu
Roopal Garg
Peter Anderson
Ranjay Krishna
Jason Baldridge
Mohit Bansal
Jordi Pont-Tuset
Su Wang
EGVM
22
66
0
27 Oct 2023
DiffAttack: Evasion Attacks Against Diffusion-Based Adversarial Purification
Mintong Kang
D. Song
Bo-wen Li
33
22
0
27 Oct 2023
Qilin-Med-VL: Towards Chinese Large Vision-Language Model for General Healthcare
Junling Liu
Ziming Wang
Qichen Ye
Dading Chong
Peilin Zhou
Yining Hua
VLM
LM&MA
19
47
0
27 Oct 2023
Reconstructive Latent-Space Neural Radiance Fields for Efficient 3D Scene Representations
Tristan Aumentado-Armstrong
Ashkan Mirzaei
Marcus A. Brubaker
J. Kelly
Alex Levinshtein
Konstantinos G. Derpanis
Igor Gilitschenski
22
4
0
27 Oct 2023
Real-time Animation Generation and Control on Rigged Models via Large Language Models
Han Huang
Fernanda De La Torre
Cathy Mengying Fang
Andrzej Banburski-Fahey
Judith Amores
Jaron Lanier
AI4CE
27
8
0
27 Oct 2023
Large-scale Foundation Models and Generative AI for BigData Neuroscience
Ran Wang
Zhe Sage Chen
MedIm
AI4CE
LRM
16
8
0
27 Oct 2023
A Wireless AI-Generated Content (AIGC) Provisioning Framework Empowered by Semantic Communication
Runze Cheng
Yao Sun
Dusit Niyato
Lan Zhang
Lei Zhang
Muhammad Ali Imran
15
11
0
26 Oct 2023
6-DoF Stability Field via Diffusion Models
Takuma Yoneda
Tianchong Jiang
Gregory Shakhnarovich
Matthew R. Walter
22
2
0
26 Oct 2023
torchdistill Meets Hugging Face Libraries for Reproducible, Coding-Free Deep Learning Studies: A Case Study on NLP
Yoshitomo Matsubara
VLM
8
1
0
26 Oct 2023
Noise-Free Score Distillation
Oren Katzir
Or Patashnik
Daniel Cohen-Or
Dani Lischinski
DiffM
13
70
0
26 Oct 2023
Global Structure-Aware Diffusion Process for Low-Light Image Enhancement
Jinhui Hou
Zhiyu Zhu
Junhui Hou
Hui Liu
Huanqiang Zeng
Hui Yuan
40
75
0
26 Oct 2023
DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation
Yongxin Zhu
Zhujin Gao
Xinyuan Zhou
Zhongyi Ye
Linli Xu
26
2
0
26 Oct 2023
SD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matching
Xinghui Li
Jingyi Lu
Kai Han
V. Prisacariu
DiffM
30
19
0
26 Oct 2023
AntifakePrompt: Prompt-Tuned Vision-Language Models are Fake Image Detectors
You-Ming Chang
Chen Yeh
Wei-Chen Chiu
Ning Yu
VPVLM
VLM
64
21
0
26 Oct 2023
Exploring the Potential of Generative AI for the World Wide Web
Nouar Aldahoul
Joseph Hong
Matteo Varvello
Yasir Zaki
6
6
0
26 Oct 2023
CADS: Unleashing the Diversity of Diffusion Models through Condition-Annealed Sampling
Seyedmorteza Sadat
Jakob Buhmann
Derek Bradley
Otmar Hilliges
Romann M. Weber
DiffM
23
38
0
26 Oct 2023
Semantic Generative Augmentations for Few-Shot Counting
Perla Doubinsky
Nicolas Audebert
M. Crucianu
Hervé Le Borgne
VLM
DiffM
19
4
0
26 Oct 2023
Defect Spectrum: A Granular Look of Large-Scale Defect Datasets with Rich Semantics
Shuai Yang
Zhifei Chen
Pengguang Chen
Xi Fang
Yixun Liang
Shu Liu
Ying-Cong Chen
16
10
0
26 Oct 2023
Attribute Based Interpretable Evaluation Metrics for Generative Models
Dongkyun Kim
Mingi Kwon
Youngjung Uh
EGVM
12
2
0
26 Oct 2023
Exploring Iterative Refinement with Diffusion Models for Video Grounding
Xiao Liang
Tao Shi
Yaoyuan Liang
Te Tao
Shao-Lun Huang
DiffM
27
1
0
26 Oct 2023
Improving Denoising Diffusion Models via Simultaneous Estimation of Image and Noise
Zhenkai Zhang
Krista A. Ehinger
Tom Drummond
DiffM
36
0
0
26 Oct 2023
Unleashing the potential of GNNs via Bi-directional Knowledge Transfer
Shuai Zheng
Zhizhe Liu
Zhenfeng Zhu
Xingxing Zhang
Jianxin Li
Yao-Min Zhao
25
0
0
26 Oct 2023
HyperFields: Towards Zero-Shot Generation of NeRFs from Text
Sudarshan Babu
Richard Liu
Avery Zhou
Michael Maire
Greg Shakhnarovich
Rana Hanocka
AI4CE
19
10
0
26 Oct 2023
PERF: Panoramic Neural Radiance Field from a Single Panorama
Guangcong Wang
Peng Wang
Zhaoxi Chen
Wenping Wang
Chen Change Loy
Ziwei Liu
MDE
18
31
0
25 Oct 2023
CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images
Aaron Gokaslan
A. Feder Cooper
Jasmine Collins
Landan Seguin
Austin Jacobson
Mihir Patel
Jonathan Frankle
Cory Stephenson
Volodymyr Kuleshov
DiffM
17
16
0
25 Oct 2023
DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
Jingxiang Sun
Bo Zhang
Ruizhi Shao
Lizhen Wang
Wen Liu
Zhenda Xie
Yebin Liu
23
132
0
25 Oct 2023
Kiki or Bouba? Sound Symbolism in Vision-and-Language Models
Morris Alper
Hadar Averbuch-Elor
33
10
0
25 Oct 2023
Multi-scale Diffusion Denoised Smoothing
Jongheon Jeong
Jinwoo Shin
DiffM
16
8
0
25 Oct 2023
MultiPrompter: Cooperative Prompt Optimization with Multi-Agent Reinforcement Learning
Dong-Ki Kim
Sungryull Sohn
Lajanugen Logeswaran
Dongsub Shim
Honglak Lee
LLMAG
28
1
0
25 Oct 2023
A Picture is Worth a Thousand Words: Principled Recaptioning Improves Image Generation
Eyal Segalis
Dani Valevski
Danny Lumen
Yossi Matias
Yaniv Leviathan
DiffM
42
22
0
25 Oct 2023
Free-form Flows: Make Any Architecture a Normalizing Flow
Felix Dräxler
Peter Sorrenson
Lea Zimmermann
Armand Rousselot
Ullrich Kothe
TPM
DRL
AI4CE
BDL
24
8
0
25 Oct 2023
Adapt Anything: Tailor Any Image Classifiers across Domains And Categories Using Text-to-Image Diffusion Models
Weijie Chen
Haoyu Wang
Shicai Yang
Lei Zhang
Wei Wei
Yanning Zhang
Luojun Lin
Di Xie
Yueting Zhuang
DiffM
VLM
OOD
23
0
0
25 Oct 2023
Fuse Your Latents: Video Editing with Multi-source Latent Diffusion Models
Tianyi Lu
Xing Zhang
Jiaxi Gu
Hang Xu
Renjing Pei
Songcen Xu
Zuxuan Wu
DiffM
VGen
25
4
0
25 Oct 2023
Dolfin: Diffusion Layout Transformers without Autoencoder
Yilin Wang
Zeyuan Chen
Liangjun Zhong
Zheng Ding
Zhizhou Sha
Zhuowen Tu
35
16
0
25 Oct 2023
Removing Dust from CMB Observations with Diffusion Models
David Heurtel-Depeiges
B. Burkhart
Ruben Ohana
Bruno Régaldo-Saint Blancard
DiffM
11
1
0
25 Oct 2023
UAV-Sim: NeRF-based Synthetic Data Generation for UAV-based Perception
Christopher Maxey
Jaehoon Choi
Hyungtae Lee
Dinesh Manocha
Heesung Kwon
19
8
0
25 Oct 2023
On the Proactive Generation of Unsafe Images From Text-To-Image Models Using Benign Prompts
Yixin Wu
Ning Yu
Michael Backes
Yun Shen
Yang Zhang
DiffM
51
8
0
25 Oct 2023
Local Statistics for Generative Image Detection
Yung Jer Wong
Teck Khim Ng
DiffM
26
2
0
25 Oct 2023
TiC-CLIP: Continual Training of CLIP Models
Saurabh Garg
Mehrdad Farajtabar
Hadi Pouransari
Raviteja Vemulapalli
Sachin Mehta
Oncel Tuzel
Vaishaal Shankar
Fartash Faghri
VLM
CLIP
31
26
0
24 Oct 2023
iNVS: Repurposing Diffusion Inpainters for Novel View Synthesis
Yash Kant
Aliaksandr Siarohin
Michael Vasilkovsky
R. A. Guler
Jian Ren
Sergey Tulyakov
Igor Gilitschenski
DiffM
19
12
0
24 Oct 2023
Yin Yang Convolutional Nets: Image Manifold Extraction by the Analysis of Opposites
Augusto Seben da Rosa
F. S. Oliveira
A. S. Soares
Arnaldo Cândido Júnior
AAML
24
0
0
24 Oct 2023
Integrating View Conditions for Image Synthesis
Jinbin Bai
Zhen Dong
Aosong Feng
Xiao Zhang
Tian-Chun Ye
Kaicheng Zhou
59
12
0
24 Oct 2023
Improving Robustness and Reliability in Medical Image Classification with Latent-Guided Diffusion and Nested-Ensembles
Xing Shen
Hengguan Huang
Brennan Nichyporuk
Tal Arbel
MedIm
38
4
0
24 Oct 2023
RePoseDM: Recurrent Pose Alignment and Gradient Guidance for Pose Guided Image Synthesis
Anant Khandelwal
DiffM
11
0
0
24 Oct 2023
Recent Advances in Multi-modal 3D Scene Understanding: A Comprehensive Survey and Evaluation
Yinjie Lei
Zixuan Wang
Feng Chen
Guoqing Wang
Peng Wang
Yang Yang
29
8
0
24 Oct 2023
SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding
Haoxiang Wang
Pavan Kumar Anasosalu Vasu
Fartash Faghri
Raviteja Vemulapalli
Mehrdad Farajtabar
Sachin Mehta
Mohammad Rastegari
Oncel Tuzel
Hadi Pouransari
VLM
20
67
0
23 Oct 2023
DeTiME: Diffusion-Enhanced Topic Modeling using Encoder-decoder based LLM
Weijie Xu
Wenxiang Hu
Fanyou Wu
Srinivasan H. Sengamedu
DiffM
21
13
0
23 Oct 2023
FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling
Haonan Qiu
Menghan Xia
Yong Zhang
Yin-Yin He
Xintao Wang
Ying Shan
Ziwei Liu
DiffM
VGen
17
88
0
23 Oct 2023
FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models
Lihe Yang
Xiaogang Xu
Bingyi Kang
Yinghuan Shi
Hengshuang Zhao
19
45
0
23 Oct 2023
Previous
1
2
3
...
144
145
146
...
159
160
161
Next