ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.10752
  4. Cited By
High-Resolution Image Synthesis with Latent Diffusion Models

High-Resolution Image Synthesis with Latent Diffusion Models

20 December 2021
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
    3DV
ArXivPDFHTML

Papers citing "High-Resolution Image Synthesis with Latent Diffusion Models"

50 / 8,115 papers shown
Title
Video Motion Graphs
Video Motion Graphs
Haiyang Liu
Zhan Xu
Fa-Ting Hong
Hsin-Ping Huang
Yi Zhou
Yang Zhou
DiffM
VGen
88
0
0
26 Mar 2025
Eyes Tell the Truth: GazeVal Highlights Shortcomings of Generative AI in Medical Imaging
Eyes Tell the Truth: GazeVal Highlights Shortcomings of Generative AI in Medical Imaging
David Wong
Bin Wang
Gorkem Durak
M. Tliba
Akshay S. Chaudhari
...
Eric Hart
Drew A Torigian
J. Udupa
Elizabeth A. Krupinski
Ulas Bagci
MedIm
34
0
0
26 Mar 2025
TD-BFR: Truncated Diffusion Model for Efficient Blind Face Restoration
TD-BFR: Truncated Diffusion Model for Efficient Blind Face Restoration
Ziying Zhang
Xiang Gao
Zhixin Wang
Q. Hu
Xiaoyun Zhang
DiffM
84
0
0
26 Mar 2025
Contrastive Learning Guided Latent Diffusion Model for Image-to-Image Translation
Contrastive Learning Guided Latent Diffusion Model for Image-to-Image Translation
Qi Si
Bo Wang
Zhao Zhang
68
0
0
26 Mar 2025
Disentangled Source-Free Personalization for Facial Expression Recognition with Neutral Target Data
Disentangled Source-Free Personalization for Facial Expression Recognition with Neutral Target Data
Masoumeh Sharafi
Emma Ollivier
Muhammad Osama Zeeshan
Soufiane Belharbi
M. Pedersoli
A. L. Koerich
Simon L Bacon
EricGranger
69
1
0
26 Mar 2025
EditCLIP: Representation Learning for Image Editing
EditCLIP: Representation Learning for Image Editing
Qian Wang
Aleksandar Cvejic
Abdelrahman Eldesokey
Peter Wonka
67
0
0
26 Mar 2025
Unified Multimodal Discrete Diffusion
Unified Multimodal Discrete Diffusion
Alexander Swerdlow
Mihir Prabhudesai
Siddharth Gandhi
Deepak Pathak
Katerina Fragkiadaki
DiffM
77
0
0
26 Mar 2025
DeClotH: Decomposable 3D Cloth and Human Body Reconstruction from a Single Image
DeClotH: Decomposable 3D Cloth and Human Body Reconstruction from a Single Image
Hyeongjin Nam
Donghwan Kim
Jeongtaek Oh
Kyoung Mu Lee
DiffM
3DH
45
0
0
25 Mar 2025
PCM : Picard Consistency Model for Fast Parallel Sampling of Diffusion Models
PCM : Picard Consistency Model for Fast Parallel Sampling of Diffusion Models
Junhyuk So
Jiwoong Shin
Chaeyeon Jang
Eunhyeok Park
DiffM
48
0
0
25 Mar 2025
Learning Hazing to Dehazing: Towards Realistic Haze Generation for Real-World Image Dehazing
Learning Hazing to Dehazing: Towards Realistic Haze Generation for Real-World Image Dehazing
Ruiyi Wang
Yushuo Zheng
Zicheng Zhang
Chunyi Li
Shuaicheng Liu
Guangtao Zhai
Xiaohong Liu
DiffM
49
0
0
25 Mar 2025
PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model
PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model
Mingju Gao
Yike Pan
Huan-ang Gao
Zongzheng Zhang
Wenyi Li
Hao Dong
Hao Tang
Li Yi
Hao Zhao
VGen
40
0
0
25 Mar 2025
LayerCraft: Enhancing Text-to-Image Generation with CoT Reasoning and Layered Object Integration
LayerCraft: Enhancing Text-to-Image Generation with CoT Reasoning and Layered Object Integration
Yuyao Zhang
Jinghao Li
Yu-Wing Tai
DiffM
64
0
0
25 Mar 2025
Interpretable Generative Models through Post-hoc Concept Bottlenecks
Interpretable Generative Models through Post-hoc Concept Bottlenecks
Akshay Kulkarni
Ge Yan
Chung-En Sun
Tuomas P. Oikarinen
Tsui-Wei Weng
45
0
0
25 Mar 2025
SITA: Structurally Imperceptible and Transferable Adversarial Attacks for Stylized Image Generation
SITA: Structurally Imperceptible and Transferable Adversarial Attacks for Stylized Image Generation
Jingdan Kang
Haoxin Yang
Yan Cai
Huaidong Zhang
Xuemiao Xu
Yong Du
Shengfeng He
AAML
44
0
0
25 Mar 2025
Fine-Grained Erasure in Text-to-Image Diffusion-based Foundation Models
Fine-Grained Erasure in Text-to-Image Diffusion-based Foundation Models
K. Thakral
Tamar Glaser
Tal Hassner
Mayank Vatsa
Richa Singh
44
2
0
25 Mar 2025
LangBridge: Interpreting Image as a Combination of Language Embeddings
LangBridge: Interpreting Image as a Combination of Language Embeddings
Jiaqi Liao
Yuwei Niu
Fanqing Meng
Hao Li
Changyao Tian
...
Dianqi Li
X. Zhu
Li Yuan
Jifeng Dai
Yu Cheng
MLLM
72
0
0
25 Mar 2025
ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning
ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning
Jiaqi Liao
Z. Yang
Linjie Li
Dianqi Li
Kevin Qinghong Lin
Yu-Xi Cheng
Lijuan Wang
MLLM
LRM
57
0
0
25 Mar 2025
Reverse Prompt: Cracking the Recipe Inside Text-to-Image Generation
Reverse Prompt: Cracking the Recipe Inside Text-to-Image Generation
Zhiyao Ren
Yibing Zhan
B. Yu
Dacheng Tao
DiffM
69
0
0
25 Mar 2025
FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model
FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model
Jun Zhou
J. Li
Zunnan Xu
Hanhui Li
Yiji Cheng
Fa-Ting Hong
Qin Lin
Qinglin Lu
Xiaodan Liang
DiffM
67
1
0
25 Mar 2025
ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models
ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models
Fernando Julio Cendra
Kai Han
VLM
51
0
0
25 Mar 2025
IPGO: Indirect Prompt Gradient Optimization for Parameter-Efficient Prompt-level Fine-Tuning on Text-to-Image Models
IPGO: Indirect Prompt Gradient Optimization for Parameter-Efficient Prompt-level Fine-Tuning on Text-to-Image Models
Jianping Ye
Michel Wedel
Kunpeng Zhang
37
0
0
25 Mar 2025
Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing
Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing
Jaihoon Kim
Taehoon Yoon
Jisung Hwang
Minhyuk Sung
DiffM
54
1
0
25 Mar 2025
TeLL Me what you cant see
TeLL Me what you cant see
Saverio Cavasin
Pietro Biasetton
Mattia Tamiazzo
Mauro Conti
Simone Milani
DiffM
40
0
0
25 Mar 2025
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy
Zhi Hou
Tianyi Zhang
Yuwen Xiong
Haonan Duan
Hengjun Pu
...
Chengyang Zhao
X. Zhu
Yu Qiao
Jifeng Dai
Y. Chen
59
1
0
25 Mar 2025
AvatarArtist: Open-Domain 4D Avatarization
AvatarArtist: Open-Domain 4D Avatarization
Hongyu Liu
Xuan Wang
Ziyu Wan
Yue Ma
Jingye Chen
Yanbo Fan
Yujun Shen
Yibing Song
Qifeng Chen
41
0
0
25 Mar 2025
CoSimGen: Controllable Diffusion Model for Simultaneous Image and Mask Generation
CoSimGen: Controllable Diffusion Model for Simultaneous Image and Mask Generation
Rupak Bose
C. Nwoye
Aditya Bhat
N. Padoy
DiffM
MedIm
42
0
0
25 Mar 2025
VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models
VectorFit : Adaptive Singular & Bias Vector Fine-Tuning of Pre-trained Foundation Models
Suhas G Hegde
S. K
Aruna Tiwari
54
0
0
25 Mar 2025
SparseGS-W: Sparse-View 3D Gaussian Splatting in the Wild with Generative Priors
SparseGS-W: Sparse-View 3D Gaussian Splatting in the Wild with Generative Priors
Yiqing Li
X. Wang
Jiawei Wu
Yikun Ma
Zhi Jin
3DGS
39
0
0
25 Mar 2025
Correcting Deviations from Normality: A Reformulated Diffusion Model for Multi-Class Unsupervised Anomaly Detection
Correcting Deviations from Normality: A Reformulated Diffusion Model for Multi-Class Unsupervised Anomaly Detection
Farzad Beizaee
Gregory A. Lodygensky
Christian Desrosiers
Jose Dolz
36
0
0
25 Mar 2025
PAVE: Patching and Adapting Video Large Language Models
PAVE: Patching and Adapting Video Large Language Models
Zhuoming Liu
Yiquan Li
Khoi Duc Nguyen
Yiwu Zhong
Yin Li
KELM
LRM
79
0
0
25 Mar 2025
Single-Step Latent Consistency Model for Remote Sensing Image Super-Resolution
Single-Step Latent Consistency Model for Remote Sensing Image Super-Resolution
Xiaohui Sun
Jiangwei Mo
Hanlin Wu
Jie Ma
38
0
0
25 Mar 2025
Towards Robust Time-of-Flight Depth Denoising with Confidence-Aware Diffusion Model
Towards Robust Time-of-Flight Depth Denoising with Confidence-Aware Diffusion Model
Changyong He
Jin Zeng
Jiawei Zhang
Jiajie Guo
DiffM
44
0
0
25 Mar 2025
AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset
AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset
Haiyu Zhang
Xinyuan Chen
Yaohui Wang
Xihui Liu
Yunhong Wang
Yu Qiao
VGen
62
0
0
25 Mar 2025
EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models
EfficientMT: Efficient Temporal Adaptation for Motion Transfer in Text-to-Video Diffusion Models
Yufei Cai
Hu Han
Yuxiang Wei
Shiguang Shan
Xilin Chen
DiffM
VGen
65
0
0
25 Mar 2025
Scaling Down Text Encoders of Text-to-Image Diffusion Models
Scaling Down Text Encoders of Text-to-Image Diffusion Models
Lifu Wang
Daqing Liu
Xinchen Liu
Xiaodong He
VLM
38
0
0
25 Mar 2025
Analyzable Chain-of-Musical-Thought Prompting for High-Fidelity Music Generation
Analyzable Chain-of-Musical-Thought Prompting for High-Fidelity Music Generation
Max W. Y. Lam
Yijin Xing
Weiya You
Jingcheng Wu
Zongyu Yin
...
T. Zhao
Chien-Hung Liu
Xuchen Song
Yang Li
Yahui Zhou
LRM
56
2
0
25 Mar 2025
ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation
ORION: A Holistic End-to-End Autonomous Driving Framework by Vision-Language Instructed Action Generation
Haoyu Fu
Diankun Zhang
Zongchuang Zhao
Jianfeng Cui
Dingkang Liang
Chong Zhang
Dingyuan Zhang
Hongwei Xie
Bing Wang
Xiang Bai
38
2
0
25 Mar 2025
Surface-Aware Distilled 3D Semantic Features
Surface-Aware Distilled 3D Semantic Features
Lukas Uzolas
E. Eisemann
Petr Kellnhofer
3DPC
3DH
78
0
0
24 Mar 2025
ReconDreamer++: Harmonizing Generative and Reconstructive Models for Driving Scene Representation
ReconDreamer++: Harmonizing Generative and Reconstructive Models for Driving Scene Representation
Guosheng Zhao
Xiaofeng Wang
Chaojun Ni
Zheng Zhu
Wenkang Qin
Guan Huang
Xingang Wang
71
1
0
24 Mar 2025
Compositional Caching for Training-free Open-vocabulary Attribute Detection
Compositional Caching for Training-free Open-vocabulary Attribute Detection
Marco Garosi
Alessandro Conti
Gaowen Liu
Elisa Ricci
Massimiliano Mancini
ObjD
VLM
50
0
0
24 Mar 2025
DisentTalk: Cross-lingual Talking Face Generation via Semantic Disentangled Diffusion Model
DisentTalk: Cross-lingual Talking Face Generation via Semantic Disentangled Diffusion Model
Kangwei Liu
Junwu Liu
Yun Cao
Jinlin Guo
Xiaowei Yi
DiffM
41
0
0
24 Mar 2025
Color Conditional Generation with Sliced Wasserstein Guidance
Color Conditional Generation with Sliced Wasserstein Guidance
Alexander Lobashev
Maria Larchenko
Dmitry Guskov
DiffM
43
0
0
24 Mar 2025
PALATE: Peculiar Application of the Law of Total Expectation to Enhance the Evaluation of Deep Generative Models
PALATE: Peculiar Application of the Law of Total Expectation to Enhance the Evaluation of Deep Generative Models
Tadeusz Dziarmaga
Marcin Kądziołka
Artur Kasymov
Marcin Mazur
EGVM
100
0
0
24 Mar 2025
MuMA: 3D PBR Texturing via Multi-Channel Multi-View Generation and Agentic Post-Processing
MuMA: 3D PBR Texturing via Multi-Channel Multi-View Generation and Agentic Post-Processing
Lingting Zhu
Jingrui Ye
Runze Zhang
Zeyu Hu
Yingda Yin
...
Jinnan Chen
Shengju Qian
Xin Wang
Qingmin Liao
L. Yu
52
2
0
24 Mar 2025
Diff-Palm: Realistic Palmprint Generation with Polynomial Creases and Intra-Class Variation Controllable Diffusion Models
Diff-Palm: Realistic Palmprint Generation with Polynomial Creases and Intra-Class Variation Controllable Diffusion Models
Jianlong Jin
Chenglong Zhao
Ruixin Zhang
Sheng Shang
Jianqing Xu
...
Shaoming Wang
Yang Zhao
Shouhong Ding
Wei Jia
Yunsheng Wu
152
0
0
24 Mar 2025
Boosting Resolution Generalization of Diffusion Transformers with Randomized Positional Encodings
Boosting Resolution Generalization of Diffusion Transformers with Randomized Positional Encodings
Cong Liu
Liang Hou
Mingwu Zheng
Xin Tao
Pengfei Wan
Di Zhang
Kun Gai
49
0
0
24 Mar 2025
Panorama Generation From NFoV Image Done Right
Panorama Generation From NFoV Image Done Right
Dian Zheng
Cheng Zhang
Xiao-Ming Wu
Cao Li
Chengfei Lv
Jian-Fang Hu
Wei-Shi Zheng
DiffM
81
0
0
24 Mar 2025
Uncertainty-guided Perturbation for Image Super-Resolution Diffusion Model
Uncertainty-guided Perturbation for Image Super-Resolution Diffusion Model
Leheng Zhang
Weiyi You
Kexuan Shi
Shuhang Gu
57
0
0
24 Mar 2025
Generative Dataset Distillation using Min-Max Diffusion Model
Generative Dataset Distillation using Min-Max Diffusion Model
Junqiao Fan
Yunjiao Zhou
Min Chang Jordan Ren
Jianfei Yang
DiffM
63
0
0
24 Mar 2025
Coeff-Tuning: A Graph Filter Subspace View for Tuning Attention-Based Large Models
Coeff-Tuning: A Graph Filter Subspace View for Tuning Attention-Based Large Models
Zichen Miao
Wei Chen
Qiang Qiu
90
1
0
24 Mar 2025
Previous
123...131415...161162163
Next