Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.10752
Cited By
High-Resolution Image Synthesis with Latent Diffusion Models
20 December 2021
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"High-Resolution Image Synthesis with Latent Diffusion Models"
50 / 8,014 papers shown
Title
DenseFormer: Learning Dense Depth Map from Sparse Depth and Image via Conditional Diffusion Model
Ming Yuan
Sichao Wang
Chuang Zhang
Lei He
Qing Xu
Jianqiang Wang
DiffM
MDE
47
0
0
31 Mar 2025
ExScene: Free-View 3D Scene Reconstruction with Gaussian Splatting from a Single Image
Tianyi Gong
Boyan Li
Yifei Zhong
Fangxin Wang
3DGS
VGen
42
0
0
31 Mar 2025
ERUPT: Efficient Rendering with Unposed Patch Transformer
Maxim V. Shugaev
Vincent Chen
Maxim Karrenbach
Kyle Ashley
Bridget Kennedy
Naresh P. Cuntoor
32
0
0
31 Mar 2025
THEMIS: Towards Practical Intellectual Property Protection for Post-Deployment On-Device Deep Learning Models
Yujin Huang
Zhi Zhang
Qingchuan Zhao
Xingliang Yuan
Chunyang Chen
37
0
0
31 Mar 2025
MuseFace: Text-driven Face Editing via Diffusion-based Mask Generation Approach
Xin Zhang
Siting Huang
Xiangyang Luo
Yifan Xie
Weijiang Yu
Heng Chang
Fei Ma
Fei Richard Yu
DiffM
38
0
0
31 Mar 2025
Consistent Subject Generation via Contrastive Instantiated Concepts
Lee Hsin-Ying
Kelvin Chan
Ming Yang
DiffM
95
0
0
31 Mar 2025
Free360: Layered Gaussian Splatting for Unbounded 360-Degree View Synthesis from Extremely Sparse and Unposed Views
Chong Bao
Xiyu Zhang
Zehao Yu
Jiale Shi
Guofeng Zhang
Songyou Peng
Zhaopeng Cui
3DGS
3DV
36
0
0
31 Mar 2025
Training-Free Text-Guided Image Editing with Visual Autoregressive Model
Yufei Wang
Lanqing Guo
Z. Li
Jiaxing Huang
Pichao Wang
Bihan Wen
J. Wang
DiffM
60
1
0
31 Mar 2025
DiET-GS: Diffusion Prior and Event Stream-Assisted Motion Deblurring 3D Gaussian Splatting
Seungjun Lee
Gim Hee Lee
3DGS
DiffM
46
0
0
31 Mar 2025
Biologically Inspired Spiking Diffusion Model with Adaptive Lateral Selection Mechanism
Linghao Feng
Dongcheng Zhao
Sicheng Shen
Yi Zeng
67
0
0
31 Mar 2025
Language-Guided Trajectory Traversal in Disentangled Stable Diffusion Latent Space for Factorized Medical Image Generation
Zahra Tehraninasab
Amar Kumar
Tal Arbel
MedIm
54
0
0
30 Mar 2025
Diffusion Meets Few-shot Class Incremental Learning
Junsu Kim
Yunhoe Ku
Dongyoon Han
Seungryul Baek
DiffM
CLL
42
0
0
30 Mar 2025
Enhancing Creative Generation on Stable Diffusion-based Models
Jiyeon Han
Dahee Kwon
Gayoung Lee
Junho Kim
Jaesik Choi
DiffM
42
1
0
30 Mar 2025
A Large Scale Analysis of Gender Biases in Text-to-Image Generative Models
Leander Girrbach
Stephan Alaniz
Genevieve Smith
Zeynep Akata
40
0
0
30 Mar 2025
DiT4SR: Taming Diffusion Transformer for Real-World Image Super-Resolution
Zheng-Peng Duan
Jiawei Zhang
Xin Jin
Z. Zhang
Zheng Xiong
Dongqing Zou
Jimmy S. Ren
Chun-Le Guo
Chongyi Li
37
0
0
30 Mar 2025
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes
Nikai Du
Zhennan Chen
Z. Chen
Shan Gao
Xi Chen
Zhengkai Jiang
Jian Yang
Ying Tai
DiffM
38
0
0
30 Mar 2025
AI Agents in Engineering Design: A Multi-Agent Framework for Aesthetic and Aerodynamic Car Design
Mohamed Elrefaie
Janet Qian
Raina Wu
Qian Chen
Angela Dai
Faez Ahmed
AI4CE
41
0
0
30 Mar 2025
SketchVideo: Sketch-based Video Generation and Editing
Feng-Lin Liu
Hongbo Fu
Xintao Wang
Weicai Ye
Pengfei Wan
Di Zhang
Lin Gao
DiffM
VGen
40
0
0
30 Mar 2025
FastVAR: Linear Visual Autoregressive Modeling via Cached Token Pruning
Hang Guo
Yawei Li
Taolin Zhang
J. Wang
Tao Dai
Shu-Tao Xia
Luca Benini
67
1
0
30 Mar 2025
ViT-Linearizer: Distilling Quadratic Knowledge into Linear-Time Vision Models
Guoyizhe Wei
Rama Chellappa
31
0
0
30 Mar 2025
Embedding Shift Dissection on CLIP: Effects of Augmentations on VLM's Representation Learning
Ashim Dahal
Saydul Akbar Murad
Nick Rahimi
VLM
45
0
0
30 Mar 2025
Object Isolated Attention for Consistent Story Visualization
Xiangyang Luo
Junhao Cheng
Yifan Xie
Xin Zhang
Tao Feng
Z. Liu
Fei Ma
Fei Richard Yu
DiffM
42
1
0
30 Mar 2025
Leveraging Vision-Language Foundation Models to Reveal Hidden Image-Attribute Relationships in Medical Imaging
Amar Kumar
Anita Kriz
Barak Pertzov
Tal Arbel
MedIm
51
0
0
30 Mar 2025
Learning Coordinated Bimanual Manipulation Policies using State Diffusion and Inverse Dynamics Models
Haonan Chen
Jiaming Xu
Lily Sheng
Tianchen Ji
Shuijing Liu
Yunzhu Li
Katherine Driggs-Campbell
57
1
0
30 Mar 2025
FreeInv: Free Lunch for Improving DDIM Inversion
Yuxiang Bao
Huijie Liu
Xun Gao
Huan Fu
Guoliang Kang
44
0
0
29 Mar 2025
On Geometrical Properties of Text Token Embeddings for Strong Semantic Binding in Text-to-Image Generation
H. Seo
Junseo Bang
Haechang Lee
Joohoon Lee
Byung Hyun Lee
Se Young Chun
46
0
0
29 Mar 2025
A GAN-Enhanced Deep Learning Framework for Rooftop Detection from Historical Aerial Imagery
Pengyu Chen
Sicheng Wang
Cuizhen Wang
Senrong Wang
Beiao Huang
Lu Huang
Zhe Zang
32
0
0
29 Mar 2025
SupertonicTTS: Towards Highly Scalable and Efficient Text-to-Speech System
H. Kim
Jinhyeok Yang
Yechan Yu
Seunghun Ji
Jacob Morton
Frederik Bous
Joon Byun
Juheon Lee
49
0
0
29 Mar 2025
MeshCraft: Exploring Efficient and Controllable Mesh Generation with Flow-based DiTs
Xianglong He
Junyi Chen
Di Huang
Zexiang Liu
Xiaoshui Huang
Wanli Ouyang
C. Yuan
Yangguang Li
DiffM
52
0
0
29 Mar 2025
Geometry in Style: 3D Stylization via Surface Normal Deformation
Nam Anh Dinh
Itai Lang
Hyunwoo Kim
Oded Stein
Rana Hanocka
3DH
41
0
0
29 Mar 2025
Exploiting Mixture-of-Experts Redundancy Unlocks Multimodal Generative Abilities
Raman Dutt
Harleen Hanspal
Guoxuan Xia
Petru-Daniel Tudosiu
Alexander Black
Yongxin Yang
Steven G. McDonagh
Sarah Parisot
MoE
38
0
0
28 Mar 2025
Mono2Stereo: A Benchmark and Empirical Study for Stereo Conversion
S. Yu
Yuxin Chen
Zhongang Qi
Zeke Xie
Yifan Wang
Lijun Wang
Ying Shan
Huchuan Lu
39
0
0
28 Mar 2025
DiTFastAttnV2: Head-wise Attention Compression for Multi-Modality Diffusion Transformers
H. Zhang
R. Su
Zhihang Yuan
Pengtao Chen
Mingzhu Shen Yibo Fan
Shengen Yan
Guohao Dai
Yu Wang
39
0
0
28 Mar 2025
Imperceptible but Forgeable: Practical Invisible Watermark Forgery via Diffusion Models
Ziping Dong
Chao Shuai
Zhongjie Ba
Peng Cheng
Z. Qin
Qinglong Wang
Kui Ren
WIGM
46
0
0
28 Mar 2025
DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness
Ruining Li
Chuanxia Zheng
Christian Rupprecht
Andrea Vedaldi
37
1
0
28 Mar 2025
Masked Self-Supervised Pre-Training for Text Recognition Transformers on Large-Scale Datasets
Martin Kiss
Michal Hradiš
34
0
0
28 Mar 2025
Concept-Aware LoRA for Domain-Aligned Segmentation Dataset Generation
Minho Park
S. Park
Jungsoo Lee
Hyojin Park
Kyuwoong Hwang
Fatih Porikli
Jaegul Choo
Sungha Choi
34
0
0
28 Mar 2025
Meta-LoRA: Meta-Learning LoRA Components for Domain-Aware ID Personalization
Barış Batuhan Topal
Umut Özyurt
Zafer Doğan Budak
Ramazan Gokberk Cinbis
45
0
0
28 Mar 2025
SIGHT: Single-Image Conditioned Generation of Hand Trajectories for Hand-Object Interaction
Alexey Gavryushin
Florian Redhardt
Gaia Di Lorenzo
Luc Van Gool
Marc Pollefeys
Kaichun Mo
Xi Wang
37
0
0
28 Mar 2025
One Look is Enough: A Novel Seamless Patchwise Refinement for Zero-Shot Monocular Depth Estimation Models on High-Resolution Images
Byeongjun Kwon
Munchurl Kim
VLM
MDE
57
0
0
28 Mar 2025
High-Fidelity Diffusion Face Swapping with ID-Constrained Facial Conditioning
Dailan He
X. Wang
Shulun Wang
Guanglu Song
Bingqi Ma
Hao Shao
Y. Liu
Hongsheng Li
DiffM
60
0
0
28 Mar 2025
Diffusion models applied to skin and oral cancer classification
José J. M. Uliana
Renato A. Krohling
DiffM
MedIm
52
0
0
28 Mar 2025
Q-Insight: Understanding Image Quality via Visual Reinforcement Learning
Weiqi Li
X. Zhang
Shijie Zhao
Y. Zhang
Junlin Li
Li Zhang
Jian Andrew Zhang
46
3
0
28 Mar 2025
Zero4D: Training-Free 4D Video Generation From Single Video Using Off-the-Shelf Video Diffusion Model
Jangho Park
Taesung Kwon
Jong Chul Ye
VGen
39
0
0
28 Mar 2025
Follow Your Motion: A Generic Temporal Consistency Portrait Editing Framework with Trajectory Guidance
Haijie Yang
Z. Zhang
Hao Tang
Jianjun Qian
Jian Yang
DiffM
VGen
50
0
0
28 Mar 2025
Spatial Transport Optimization by Repositioning Attention Map for Training-Free Text-to-Image Synthesis
Woojung Han
Yeonkyung Lee
Chanyoung Kim
Kwanghyun Park
Seong Jae Hwang
DiffM
60
0
0
28 Mar 2025
Event-Based Distributed Linear Quadratic Gaussian for Multi-Robot Coordination with Localization Uncertainty
Tohid Kargar Tasooji
Sakineh Khodadadi
24
0
0
28 Mar 2025
Scenario Dreamer: Vectorized Latent Diffusion for Generating Driving Simulation Environments
Luke Rowe
Roger Girgis
Anthony Gosselin
Liam Paull
C. Pal
Felix Heide
DiffM
VGen
33
1
0
28 Mar 2025
Semantix: An Energy Guided Sampler for Semantic Style Transfer
Huiang He
Minghui Hu
C. Zheng
Chaoyue Wang
Tat-Jen Cham
DiffM
39
0
0
28 Mar 2025
EchoFlow: A Foundation Model for Cardiac Ultrasound Image and Video Generation
Hadrien Reynaud
Alberto Gomez
Paul Leeson
Qingjie Meng
B. Kainz
MedIm
54
0
0
28 Mar 2025
Previous
1
2
3
...
11
12
13
...
159
160
161
Next