Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.10752
Cited By
High-Resolution Image Synthesis with Latent Diffusion Models
20 December 2021
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"High-Resolution Image Synthesis with Latent Diffusion Models"
50 / 7,956 papers shown
Title
CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading
Mishan Aliev
Dmitry Baranchuk
Kirill Struminsky
DiffM
28
0
0
09 Apr 2025
Latent Diffusion U-Net Representations Contain Positional Embeddings and Anomalies
Jonas Loos
Lorenz Linhardt
26
0
0
09 Apr 2025
PathSegDiff: Pathology Segmentation using Diffusion model representations
Sachin Kumar Danisetty
Alexandros Graikos
Srikar Yellapragada
Dimitris Samaras
MedIm
19
0
0
09 Apr 2025
DyDiT++: Dynamic Diffusion Transformers for Efficient Visual Generation
Wangbo Zhao
Yizeng Han
Jiasheng Tang
Kai Wang
Hao Luo
Yibing Song
Gao Huang
Fan Wang
Yang You
66
0
0
09 Apr 2025
A Meaningful Perturbation Metric for Evaluating Explainability Methods
Danielle Cohen
Hila Chefer
Lior Wolf
AAML
25
0
0
09 Apr 2025
SafeMLRM: Demystifying Safety in Multi-modal Large Reasoning Models
Junfeng Fang
Y. Wang
Ruipeng Wang
Zijun Yao
Kun Wang
An Zhang
X. Wang
Tat-Seng Chua
AAML
LRM
60
3
0
09 Apr 2025
Are We Done with Object-Centric Learning?
Alexander Rubinstein
Ameya Prabhu
Matthias Bethge
Seong Joon Oh
OCL
568
0
0
09 Apr 2025
Distilling Textual Priors from LLM to Efficient Image Fusion
Ran Zhang
Xuanhua He
Ke Cao
L. Liu
Li Zhang
Man Zhou
Jie Zhang
21
0
0
09 Apr 2025
RAGME: Retrieval Augmented Video Generation for Enhanced Motion Realism
E. Peruzzo
Dejia Xu
Xingqian Xu
Humphrey Shi
N. Sebe
DiffM
VGen
54
0
0
09 Apr 2025
Detecting AI-generated Artwork
Meien Li
Mark Stamp
21
0
0
09 Apr 2025
PosterMaker: Towards High-Quality Product Poster Generation with Accurate Text Rendering
Y. Gao
Zihang Lin
Chuanbin Liu
Min Zhou
T. Ge
Bo Zheng
Hongtao Xie
DiffM
35
0
0
09 Apr 2025
HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance
Jiazi Bu
Pengyang Ling
Yujie Zhou
Pan Zhang
Tong Wu
Xiaoyi Dong
Yuhang Zang
Y. Cao
D. Lin
Jiaqi Wang
19
0
0
08 Apr 2025
Transfer between Modalities with MetaQueries
Xichen Pan
Satya Narayan Shukla
Aashu Singh
Zhuokai Zhao
Shlok Kumar Mishra
...
Jiuhai Chen
Kunpeng Li
F. Xu
Ji Hou
Saining Xie
DiffM
41
6
0
08 Apr 2025
Reconstruction-Free Anomaly Detection with Diffusion Models via Direct Latent Likelihood Evaluation
Shunsuke Sakai
Tatsuhito Hasegawa
26
0
0
08 Apr 2025
Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model
Qi Mao
L. Chen
Yuchao Gu
Mike Zheng Shou
Ming-Hsuan Yang
DiffM
39
0
0
08 Apr 2025
Flash Sculptor: Modular 3D Worlds from Objects
Yujia Hu
Songhua Liu
Xingyi Yang
Xinchao Wang
34
0
0
08 Apr 2025
QEMesh: Employing A Quadric Error Metrics-Based Representation for Mesh Generation
Jiaqi Li
Ruowei Wang
Yu Liu
Qijun Zhao
32
0
0
08 Apr 2025
D-Feat Occlusions: Diffusion Features for Robustness to Partial Visual Occlusions in Object Recognition
Rupayan Mallick
Sibo Dong
Nataniel Ruiz
Sarah Adel Bargal
DiffM
44
0
0
08 Apr 2025
Mind the Trojan Horse: Image Prompt Adapter Enabling Scalable and Deceptive Jailbreaking
Junxi Chen
Junhao Dong
Xiaohua Xie
33
0
0
08 Apr 2025
econSG: Efficient and Multi-view Consistent Open-Vocabulary 3D Semantic Gaussians
Can Zhang
G. Lee
3DV
50
0
0
08 Apr 2025
CamContextI2V: Context-aware Controllable Video Generation
Luis Denninger
Sina Mokhtarzadeh Azar
Juergen Gall
VGen
33
0
0
08 Apr 2025
OmniSVG: A Unified Scalable Vector Graphics Generation Model
Yiying Yang
Wei Cheng
Sijin Chen
Xianfang Zeng
Jiaxu Zhang
Liao Wang
Gang Yu
Xingjun Ma
Yu Jiang
VLM
40
0
0
08 Apr 2025
Measuring Déjà vu Memorization Efficiently
Narine Kokhlikyan
Bargav Jayaraman
Florian Bordes
Chuan Guo
Kamalika Chaudhuri
23
1
0
08 Apr 2025
CDM-QTA: Quantized Training Acceleration for Efficient LoRA Fine-Tuning of Diffusion Model
Jinming Lu
Minghao She
Wendong Mao
Zhongfeng Wang
MQ
33
0
0
08 Apr 2025
Releasing Differentially Private Event Logs Using Generative Models
Frederik Wangelik
Majid Rafiei
M. Pourbafrani
Wil M.P. van der Aalst
23
0
0
08 Apr 2025
Storybooth: Training-free Multi-Subject Consistency for Improved Visual Storytelling
Jaskirat Singh
Junshen Kevin Chen
Jonas Kohler
Michael Cohen
DiffM
VGen
35
0
0
08 Apr 2025
A Training-Free Style-aligned Image Generation with Scale-wise Autoregressive Model
Jihun Park
Jongmin Gim
Kyoungmin Lee
Minseok Oh
Minwoo Choi
Jaeyeul Kim
Woo Chool Park
Sunghoon Im
DiffM
25
0
0
08 Apr 2025
Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation
Xiaoxing Hu
Ziyang Gong
Y. Wang
Yuru Jia
Gen Luo
Xue Yang
70
0
0
08 Apr 2025
SMF: Template-free and Rig-free Animation Transfer using Kinetic Codes
Sanjeev Muralikrishnan
Niladri Shekhar Dutt
Niloy J. Mitra
21
0
0
07 Apr 2025
Video-Bench: Human-Aligned Video Generation Benchmark
Hui Han
Siyuan Li
Jiaqi Chen
Yiwen Yuan
Yuling Wu
...
Y. Li
J. Zhang
Chi Zhang
Li Li
Yongxin Ni
EGVM
VGen
65
0
0
07 Apr 2025
Studying Image Diffusion Features for Zero-Shot Video Object Segmentation
Thanos Delatolas
Vicky S. Kalogeiton
Dim P. Papadopoulos
DiffM
VOS
43
1
0
07 Apr 2025
AnyArtisticGlyph: Multilingual Controllable Artistic Glyph Generation
Xiongbo Lu
Yaxiong Chen
Shengwu Xiong
DiffM
23
0
0
07 Apr 2025
SCAM: A Real-World Typographic Robustness Evaluation for Multimodal Foundation Models
Justus Westerhoff
Erblina Purellku
Jakob Hackstein
Jonas Loos
Leo Pinetzki
Lorenz Hufe
AAML
28
0
0
07 Apr 2025
Disentangling Instruction Influence in Diffusion Transformers for Parallel Multi-Instruction-Guided Image Editing
Hui Liu
Bin Zou
Suiyun Zhang
Kecheng Chen
Rui Liu
Haoliang Li
DiffM
64
0
0
07 Apr 2025
CADCrafter: Generating Computer-Aided Design Models from Unconstrained Images
Cheng Chen
Jiacheng Wei
Tianrun Chen
Chi Zhang
Xiaofeng Yang
...
Bingchen Yang
Chuan-Sheng Foo
Guosheng Lin
Qixing Huang
Fayao Liu
44
0
0
07 Apr 2025
Enhancing Compositional Reasoning in Vision-Language Models with Synthetic Preference Data
Samarth Mishra
Kate Saenko
Venkatesh Saligrama
CoGe
LRM
37
0
0
07 Apr 2025
Lumina-OmniLV: A Unified Multimodal Framework for General Low-Level Vision
Yuandong Pu
Le Zhuo
Kaiwen Zhu
Liangbin Xie
Wenlong Zhang
Xiangyu Chen
Peng Gao
Yu Qiao
Chao Dong
Yihao Liu
MLLM
61
1
0
07 Apr 2025
Federated Learning for Medical Image Classification: A Comprehensive Benchmark
Zhekai Zhou
Guibo Luo
Mingzhi Chen
Zhenyu Weng
Yuesheng Zhu
FedML
21
0
0
07 Apr 2025
From Specificity to Generality: Revisiting Generalizable Artifacts in Detecting Face Deepfakes
Long Ma
Zhiyuan Yan
Yize Chen
Jin Xu
Qinglang Guo
Hu Huang
Yong Liao
Hui Lin
CVBM
41
0
0
07 Apr 2025
PanoDreamer: Consistent Text to 360-Degree Scene Generation
Zhexiao Xiong
Z. Chen
Zhong Li
Yi Tian Xu
Nathan Jacobs
3DGS
VGen
26
0
0
07 Apr 2025
CREA: A Collaborative Multi-Agent Framework for Creative Content Generation with Diffusion Models
Kavana Venkatesh
Connor Dunlop
Pinar Yanardag
DiffM
33
0
0
07 Apr 2025
PartStickers: Generating Parts of Objects for Rapid Prototyping
Mo Zhou
Josh Myers-Dean
Danna Gurari
21
0
0
07 Apr 2025
TactileNet: Bridging the Accessibility Gap with AI-Generated Tactile Graphics for Individuals with Vision Impairment
Adnan Khan
Alireza Choubineh
Mai A. Shaaban
Abbas Akkasi
Majid Komeili
DiffM
30
0
0
07 Apr 2025
Gaussian Mixture Flow Matching Models
Hansheng Chen
Kai Zhang
Hao Tan
Zexiang Xu
Fujun Luan
Leonidas J. Guibas
Gordon Wetzstein
Sai Bi
DiffM
61
0
0
07 Apr 2025
BrainMRDiff: A Diffusion Model for Anatomically Consistent Brain MRI Synthesis
Moinak Bhattacharya
Saumya Gupta
Annie Singh
C. L. P. Chen
Gagandeep Singh
Prateek Prasanna
MedIm
26
0
0
06 Apr 2025
Your Image Generator Is Your New Private Dataset
Nicolo Resmini
Eugenio Lomurno
Cristian Sbrolli
Matteo Matteucci
26
0
0
06 Apr 2025
Attributed Synthetic Data Generation for Zero-shot Domain-specific Image Classification
Shijian Wang
Linxin Song
Ryotaro Shimizu
M. Goto
Hanqian Wu
VLM
18
0
0
06 Apr 2025
UniToken: Harmonizing Multimodal Understanding and Generation through Unified Visual Encoding
Yang Jiao
Haibo Qiu
Zequn Jie
S. Chen
Jingjing Chen
Lin Ma
Yu Jiang
26
2
0
06 Apr 2025
Multi-identity Human Image Animation with Structural Video Diffusion
Zhenzhi Wang
Y. Li
Yanhong Zeng
Yuwei Guo
D. Lin
Tianfan Xue
Bo Dai
VGen
24
0
0
05 Apr 2025
DiTaiListener: Controllable High Fidelity Listener Video Generation with Diffusion
Maksim Siniukov
Di Chang
Minh Tran
Hongkun Gong
Ashutosh Chaubey
Mohammad Soleymani
DiffM
VGen
23
0
0
05 Apr 2025
Previous
1
2
3
...
8
9
10
...
158
159
160
Next