Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.10752
Cited By
High-Resolution Image Synthesis with Latent Diffusion Models
20 December 2021
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"High-Resolution Image Synthesis with Latent Diffusion Models"
50 / 7,852 papers shown
Title
Taming Consistency Distillation for Accelerated Human Image Animation
X. Wang
Shiwei Zhang
Hangjie Yuan
Yujie Wei
Y. Zhang
Changxin Gao
Yuehuan Wang
Nong Sang
VGen
22
0
0
15 Apr 2025
OmniVDiff: Omni Controllable Video Diffusion for Generation and Understanding
Dianbing Xi
J. Wang
Yuanzhi Liang
Xi Qiu
Yuchi Huo
R. Wang
Chi Zhang
X. Li
DiffM
VGen
62
0
0
15 Apr 2025
ADT: Tuning Diffusion Models with Adversarial Supervision
Dazhong Shen
Guanglu Song
Y. Zhang
Bingqi Ma
Lujundong Li
D. Jiang
Zhuofan Zong
Y. Liu
DiffM
40
0
0
15 Apr 2025
DeepWheel: Generating a 3D Synthetic Wheel Dataset for Design and Performance Evaluation
Soyoung Yoo
Namwoo Kang
30
0
0
15 Apr 2025
InterAnimate: Taming Region-aware Diffusion Model for Realistic Human Interaction Animation
Yukang Lin
Y. Hong
Zunnan Xu
X. Li
Chao Xu
...
Jun Lan
Huijia Zhu
Weiqiang Wang
Jianfu Zhang
Xiu Li
VGen
46
0
0
15 Apr 2025
GaSLight: Gaussian Splats for Spatially-Varying Lighting in HDR
Christophe Bolduc
Yannick Hold-Geoffroy
Zhixin Shu
Jean-François Lalonde
3DGS
29
0
0
15 Apr 2025
Explicit and Implicit Representations in AI-based 3D Reconstruction for Radiology: A systematic literature review
Yuezhe Yang
Boyu Yang
Yaqian Wang
Yang He
Xingbo Dong
Zhe Jin
38
0
0
15 Apr 2025
SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL
Junke Wang
Zhi Tian
X. Wang
Xinyu Zhang
Weilin Huang
Zuxuan Wu
Yu Jiang
VGen
43
3
0
15 Apr 2025
Prototype-Guided Diffusion for Digital Pathology: Achieving Foundation Model Performance with Minimal Clinical Data
Ekaterina Redekop
Mara Pleasure
Vedrana Ivezić
Zichen Wang
Kimberly Flores
Anthony Sisk
W. Speier
C. Arnold
MedIm
33
0
0
15 Apr 2025
NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors
Yanrui Bin
Wenbo Hu
Haoyuan Wang
Xinya Chen
Bing Wang
DiffM
45
0
0
15 Apr 2025
Hierarchical and Step-Layer-Wise Tuning of Attention Specialty for Multi-Instance Synthesis in Diffusion Transformers
Chunyang Zhang
Zhenhong Sun
Zhicheng Zhang
Junyan Wang
Yu Zhang
Dong Gong
H. Mo
Daoyi Dong
33
0
0
14 Apr 2025
MonoDiff9D: Monocular Category-Level 9D Object Pose Estimation via Diffusion Model
Jian Liu
Wei Sun
Hui Yang
Jin Zheng
Zichen Geng
Hossein Rahmani
Ajmal Saeed Mian
DiffM
33
0
0
14 Apr 2025
Analysis of Attention in Video Diffusion Transformers
Yuxin Wen
Jim Wu
Ajay Jain
Tom Goldstein
Ashwinee Panda
35
1
0
14 Apr 2025
Efficient Generative Model Training via Embedded Representation Warmup
Deyuan Liu
Peng Sun
Xufeng Li
Tao Lin
19
0
0
14 Apr 2025
Efficient Task-specific Conditional Diffusion Policies: Shortcut Model Acceleration and SO(3) Optimization
Haiyong Yu
Yanqiong Jin
Yonghao He
Wei Sui
22
0
0
14 Apr 2025
Prior Does Matter: Visual Navigation via Denoising Diffusion Bridge Models
Hao Ren
Yiming Zeng
Zetong Bi
Zhaoliang Wan
Junlong Huang
Hui Cheng
54
1
0
14 Apr 2025
Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing
Taihang Hu
Linxuan Li
Kai Wang
Yaxing Wang
Jian Yang
Ming-Ming Cheng
DiffM
VGen
23
0
0
14 Apr 2025
REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers
Xingjian Leng
Jaskirat Singh
Yunzhong Hou
Zhenchang Xing
Saining Xie
Liang Zheng
34
0
0
14 Apr 2025
H3AE: High Compression, High Speed, and High Quality AutoEncoder for Video Diffusion Models
Yushu Wu
Yanyu Li
Ivan Skorokhodov
Anil Kag
Willi Menapace
Sharath Girish
Aliaksandr Siarohin
Yanzhi Wang
Sergey Tulyakov
DiffM
VGen
35
0
0
14 Apr 2025
Enhanced Semantic Extraction and Guidance for UGC Image Super Resolution
Yiwen Wang
Ying Liang
Yuxuan Zhang
Xinning Chai
Zhengxue Cheng
Yingsheng Qin
Yucai Yang
Rong Xie
Li-Na Song
24
1
0
14 Apr 2025
SpinMeRound: Consistent Multi-View Identity Generation Using Diffusion Models
Stathis Galanakis
Alexandros Lattas
Stylianos Moschoglou
Bernhard Kainz
S. Zafeiriou
DiffM
33
0
0
14 Apr 2025
InstructEngine: Instruction-driven Text-to-Image Alignment
Xingyu Lu
Y. Hu
Y. Zhang
Kaiyu Jiang
Changyi Liu
...
Bin Wen
C. Yuan
Fan Yang
Tingting Gao
Di Zhang
34
0
0
14 Apr 2025
DiffMOD: Progressive Diffusion Point Denoising for Moving Object Detection in Remote Sensing
Jinyue Zhang
Xiangrong Zhang
Zhongjian Huang
Tianyang Zhang
Yifei Jiang
Licheng Jiao
DiffM
19
0
0
14 Apr 2025
Computer-Aided Layout Generation for Building Design: A Review
Jiachen Liu
Yuan Xue
Haomiao Ni
Rui Yu
Zihan Zhou
S. X. Huang
3DV
AI4CE
72
0
0
13 Apr 2025
Early-Bird Diffusion: Investigating and Leveraging Timestep-Aware Early-Bird Tickets in Diffusion Models for Efficient Training
Lexington Whalen
Zhenbang Du
Haoran You
Chaojian Li
Sixu Li
Yingyan
31
0
0
13 Apr 2025
Scalable Motion In-betweening via Diffusion and Physics-Based Character Adaptation
Jia Qin
DiffM
VGen
36
0
0
13 Apr 2025
CamMimic: Zero-Shot Image To Camera Motion Personalized Video Generation Using Diffusion Models
P. Guhan
D. Kothandaraman
Tsung-Wei Huang
Guan-Ming Su
Dinesh Manocha
DiffM
VGen
34
0
0
13 Apr 2025
SD-ReID: View-aware Stable Diffusion for Aerial-Ground Person Re-Identification
Xiang Hu
Pingping Zhang
Yuhao Wang
Bin Yan
Huchuan Lu
23
0
0
13 Apr 2025
D
2
^2
2
iT: Dynamic Diffusion Transformer for Accurate Image Generation
Weinan Jia
Mengqi Huang
Nan Chen
Lei Zhang
Zhendong Mao
21
0
0
13 Apr 2025
DiTSE: High-Fidelity Generative Speech Enhancement via Latent Diffusion Transformers
Heitor R. Guimarães
Jiaqi Su
Rithesh Kumar
Tiago H. Falk
Zeyu Jin
DiffM
30
2
0
13 Apr 2025
Probability Distribution Alignment and Low-Rank Weight Decomposition for Source-Free Domain Adaptive Brain Decoding
Ganxi Xu
Jinyi Long
Hanrui Wu
24
0
0
12 Apr 2025
MedIL: Implicit Latent Spaces for Generating Heterogeneous Medical Images at Arbitrary Resolutions
Tyler A. Spears
Shen Zhu
Yinzhu Jin
A. Shrivastava
P. T. Fletcher
LM&MA
MedIm
45
0
0
12 Apr 2025
Generation of Musical Timbres using a Text-Guided Diffusion Model
Weixuan Yuan
Qadeer Khan
Vladimir Golkov
DiffM
24
0
0
12 Apr 2025
Flux Already Knows -- Activating Subject-Driven Image Generation without Training
Hao Kang
Stathi Fotiadis
Liming Jiang
Qing Yan
Yumin Jia
Zichuan Liu
Min Jin Chong
Xin Lu
35
0
0
12 Apr 2025
Sculpting Memory: Multi-Concept Forgetting in Diffusion Models via Dynamic Mask and Concept-Aware Optimization
Gen Li
Yang Xiao
Jie Ji
Kaiyuan Deng
Bo Hui
Linke Guo
Xiaolong Ma
24
0
0
12 Apr 2025
CoProSketch: Controllable and Progressive Sketch Generation with Diffusion Model
Ruohao Zhan
Yijin Li
Yisheng He
Shuo Chen
Yichen Shen
Xinyu Chen
Zilong Dong
Zhaoyang Huang
Guofeng Zhang
DiffM
27
0
0
11 Apr 2025
GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation
Tianwei Xiong
Jun Hao Liew
Zilong Huang
Jiashi Feng
Xihui Liu
29
0
0
11 Apr 2025
DreamFuse: Adaptive Image Fusion with Diffusion Transformer
Junjia Huang
Pengxiang Yan
Jiyang Liu
Jie Wu
Zhao Wang
Yitong Wang
Liang Lin
G. Li
35
0
0
11 Apr 2025
Diffusion Models for Robotic Manipulation: A Survey
Rosa Wolf
Yitian Shi
Sheng Liu
Rania Rayyes
51
1
0
11 Apr 2025
LookingGlass: Generative Anamorphoses via Laplacian Pyramid Warping
Pascal Chang
Sergio Sancho
Jingwei Tang
Markus Gross
Vinicius Azevedo
28
0
0
11 Apr 2025
Geometric Consistency Refinement for Single Image Novel View Synthesis via Test-Time Adaptation of Diffusion Models
Josef Bengtson
David Nilsson
Fredrik Kahl
DiffM
37
0
0
11 Apr 2025
MotionDreamer: One-to-Many Motion Synthesis with Localized Generative Masked Transformer
Yilin Wang
Chuan Guo
Yuxuan Mu
Muhammad Gohar Javed
X. Zuo
Juwei Lu
Hai Jiang
Li Cheng
VGen
30
0
0
11 Apr 2025
COP-GEN-Beta: Unified Generative Modelling of COPernicus Imagery Thumbnails
Miguel Espinosa
V. Marsocci
Yuru Jia
Elliot J. Crowley
Mikolaj Czerkawski
DiffM
47
0
0
11 Apr 2025
Discriminator-Free Direct Preference Optimization for Video Diffusion
Haoran Cheng
Qide Dong
Liang Peng
Zhizhou Sha
Weiguo Feng
Jinghui Xie
Zhao Song
Shilei Wen
Xiaofei He
Boxi Wu
VGen
46
0
0
11 Apr 2025
On Background Bias of Post-Hoc Concept Embeddings in Computer Vision DNNs
Gesina Schwalbe
Georgii Mikriukov
Edgar Heinert
Stavros Gerolymatos
Mert Keser
Alois Knoll
Matthias Rottmann
Annika Mütze
29
0
0
11 Apr 2025
Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model
Team Seawead
Ceyuan Yang
Zhijie Lin
Yang Zhao
Shanchuan Lin
...
Zuquan Song
Zhenheng Yang
Jiashi Feng
Jianchao Yang
Lu Jiang
DiffM
79
1
0
11 Apr 2025
Muon-Accelerated Attention Distillation for Real-Time Edge Synthesis via Optimized Latent Diffusion
Weiye Chen
Qingen Zhu
Qian Long
24
0
0
11 Apr 2025
Palmprint De-Identification Using Diffusion Model for High-Quality and Diverse Synthesis
Licheng Yan
Bob Zhang
Andrew Beng Jin Teoh
L. Leng
Shuyi Li
Yuqi Wang
Ziyuan Yang
28
0
0
11 Apr 2025
ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration
Yongsheng Yu
Haitian Zheng
Zhifei Zhang
Jianming Zhang
Yuqian Zhou
Connelly Barnes
Y. Liu
Wei Xiong
Zhe Lin
Jiebo Luo
44
0
0
11 Apr 2025
TokenMotion: Decoupled Motion Control via Token Disentanglement for Human-centric Video Generation
Ruineng Li
Daitao Xing
Huiming Sun
Yuanzhou Ha
Jinglin Shen
C. Ho
DiffM
VGen
37
0
0
11 Apr 2025
Previous
1
2
3
...
6
7
8
...
156
157
158
Next