ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.10752
  4. Cited By
High-Resolution Image Synthesis with Latent Diffusion Models

High-Resolution Image Synthesis with Latent Diffusion Models

20 December 2021
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
    3DV
ArXivPDFHTML

Papers citing "High-Resolution Image Synthesis with Latent Diffusion Models"

50 / 8,132 papers shown
Title
D2C: Unlocking the Potential of Continuous Autoregressive Image Generation with Discrete Tokens
D2C: Unlocking the Potential of Continuous Autoregressive Image Generation with Discrete Tokens
Panpan Wang
Liqiang Niu
Fandong Meng
Jinan Xu
Yufeng Chen
Jie Zhou
DiffM
50
0
0
21 Mar 2025
DermDiff: Generative Diffusion Model for Mitigating Racial Biases in Dermatology Diagnosis
DermDiff: Generative Diffusion Model for Mitigating Racial Biases in Dermatology Diagnosis
Nusrat Munia
Abdullah-Al-Zubaer Imran
MedIm
47
1
0
21 Mar 2025
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer
Qingyu Shi
Jianzong Wu
Jinbin Bai
J. Zhang
Lu Qi
X. Li
Yunhai Tong
46
0
0
21 Mar 2025
FreeUV: Ground-Truth-Free Realistic Facial UV Texture Recovery via Cross-Assembly Inference Strategy
FreeUV: Ground-Truth-Free Realistic Facial UV Texture Recovery via Cross-Assembly Inference Strategy
Xingchao Yang
Takafumi Taketomi
Yuki Endo
Yoshihiro Kanamori
DiffM
46
0
0
21 Mar 2025
Auto-Regressive Diffusion for Generating 3D Human-Object Interactions
Auto-Regressive Diffusion for Generating 3D Human-Object Interactions
Zichen Geng
Zeeshan Hayder
W. Liu
Ajmal Saeed Mian
DiffM
VGen
61
0
0
21 Mar 2025
AnimatePainter: A Self-Supervised Rendering Framework for Reconstructing Painting Process
AnimatePainter: A Self-Supervised Rendering Framework for Reconstructing Painting Process
J. Hu
Shuyong Gao
Qianyu Guo
Yan Wang
Qishan Wang
Yuang Feng
Wenqiang Zhang
DiffM
VGen
47
0
0
21 Mar 2025
Bayesian generative models can flag performance loss, bias, and out-of-distribution image content
Bayesian generative models can flag performance loss, bias, and out-of-distribution image content
Miguel López-Pérez
M. Miani
Valery Naranjo
Søren Hauberg
Aasa Feragen
OOD
MedIm
54
0
0
21 Mar 2025
ProDehaze: Prompting Diffusion Models Toward Faithful Image Dehazing
ProDehaze: Prompting Diffusion Models Toward Faithful Image Dehazing
Tianwen Zhou
Jing Wang
Songtao Wu
Kuanhong Xu
DiffM
46
0
0
21 Mar 2025
FFaceNeRF: Few-shot Face Editing in Neural Radiance Fields
FFaceNeRF: Few-shot Face Editing in Neural Radiance Fields
Kwan Yun
Chaelin Kim
Hangyeul Shin
Junyong Noh
CVBM
51
0
0
21 Mar 2025
Is there anything left? Measuring semantic residuals of objects removed from 3D Gaussian Splatting
Is there anything left? Measuring semantic residuals of objects removed from 3D Gaussian Splatting
Simona Kocour
Assia Benbihi
Aikaterini Adam
Torsten Sattler
3DPC
41
0
0
21 Mar 2025
Pow3R: Empowering Unconstrained 3D Reconstruction with Camera and Scene Priors
Pow3R: Empowering Unconstrained 3D Reconstruction with Camera and Scene Priors
Wonbong Jang
Philippe Weinzaepfel
Vincent Leroy
Lourdes Agapito
Jérôme Revaud
51
0
0
21 Mar 2025
UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion Models
UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion Models
Fanghua Yu
Jinjin Gu
Jinfan Hu
Zheyuan Li
Chao Dong
DiffM
52
0
0
21 Mar 2025
Real-Time Diffusion Policies for Games: Enhancing Consistency Policies with Q-Ensembles
Real-Time Diffusion Policies for Games: Enhancing Consistency Policies with Q-Ensembles
Ruoqi Zhang
Ziwei Luo
Jens Sjölund
Per Mattsson
Linus Gisslén
Alessandro Sestini
42
1
0
21 Mar 2025
Dereflection Any Image with Diffusion Priors and Diversified Data
Dereflection Any Image with Diffusion Priors and Diversified Data
Jichen Hu
Chen-Ning Yang
Zanwei Zhou
Jiemin Fang
Xiaokang Yang
Q. Tian
Wei-Ming Shen
44
0
0
21 Mar 2025
Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks
Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks
Bhishma Dedhia
David Bourgin
Krishna Kumar Singh
Yuheng Li
Yan Kang
Zhan Xu
N. Jha
Y. Liu
DiffM
VGen
72
0
0
21 Mar 2025
Halton Scheduler For Masked Generative Image Transformer
Halton Scheduler For Masked Generative Image Transformer
Victor Besnier
Mickael Chen
David Hurych
Eduardo Valle
Matthieu Cord
52
1
0
21 Mar 2025
R2LDM: An Efficient 4D Radar Super-Resolution Framework Leveraging Diffusion Model
R2LDM: An Efficient 4D Radar Super-Resolution Framework Leveraging Diffusion Model
Boyuan Zheng
Shouyi Lu
Renbo Huang
Minqing Huang
Fan Lu
Wei Tian
G. Zhuo
Lu Xiong
62
1
0
21 Mar 2025
Enabling Versatile Controls for Video Diffusion Models
Enabling Versatile Controls for Video Diffusion Models
Xu Zhang
Hao Zhou
Haoming Qin
Xiaobin Lu
Jiaxing Yan
Guanzhong Wang
Zeyu Chen
Yi Liu
DiffM
VGen
65
0
0
21 Mar 2025
Neuro-Symbolic Scene Graph Conditioning for Synthetic Image Dataset Generation
Neuro-Symbolic Scene Graph Conditioning for Synthetic Image Dataset Generation
Giacomo Savazzi
Eugenio Lomurno
Cristian Sbrolli
Agnese Chiatti
Matteo Matteucci
42
0
0
21 Mar 2025
Missing Target-Relevant Information Prediction with World Model for Accurate Zero-Shot Composed Image Retrieval
Missing Target-Relevant Information Prediction with World Model for Accurate Zero-Shot Composed Image Retrieval
Yuanmin Tang
Jing Yu
Keke Gai
Jiamin Zhuang
Gang Xiong
Gaopeng Gou
Qi Wu
VGen
49
1
0
21 Mar 2025
Re-HOLD: Video Hand Object Interaction Reenactment via adaptive Layout-instructed Diffusion Model
Re-HOLD: Video Hand Object Interaction Reenactment via adaptive Layout-instructed Diffusion Model
Yingying Fan
Quanwei Yang
Kaisiyuan Wang
Hang Zhou
Yingying Li
Haocheng Feng
Errui Ding
Y. Wu
J. Wang
DiffM
44
0
0
21 Mar 2025
PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning
PRIMAL: Physically Reactive and Interactive Motor Model for Avatar Learning
Yan Zhang
Yao Feng
Alpár Cseke
Nitin Saini
Nathan Bajandas
Nicolas Heron
M. Black
DiffM
VGen
64
0
0
21 Mar 2025
Shining Yourself: High-Fidelity Ornaments Virtual Try-on with Diffusion Model
Shining Yourself: High-Fidelity Ornaments Virtual Try-on with Diffusion Model
Yingmao Miao
Zhanpeng Huang
Rui Han
Zibin Wang
Chenhao Lin
Chao Shen
DiffM
47
0
0
20 Mar 2025
VerbDiff: Text-Only Diffusion Models with Enhanced Interaction Awareness
VerbDiff: Text-Only Diffusion Models with Enhanced Interaction Awareness
SeungJu Cha
Kwanyoung Lee
Ye-Chan Kim
Hyunwoo Oh
Dong-Jin Kim
48
0
0
20 Mar 2025
Reconstructing In-the-Wild Open-Vocabulary Human-Object Interactions
Reconstructing In-the-Wild Open-Vocabulary Human-Object Interactions
Boran Wen
Dingbang Huang
Zichen Zhang
J. Zhou
Jianbin Deng
Jingyu Gong
Yulong Chen
Lizhuang Ma
Y. Li
3DH
47
0
0
20 Mar 2025
Improving Autoregressive Image Generation through Coarse-to-Fine Token Prediction
Improving Autoregressive Image Generation through Coarse-to-Fine Token Prediction
Ziyao Guo
K. Zhang
Michael Qizhe Shieh
43
0
0
20 Mar 2025
UniCoRN: Latent Diffusion-based Unified Controllable Image Restoration Network across Multiple Degradations
UniCoRN: Latent Diffusion-based Unified Controllable Image Restoration Network across Multiple Degradations
Debabrata Mandal
Soumitri Chattopadhyay
Guansen Tong
Praneeth Chakravarthula
DiffM
52
0
0
20 Mar 2025
Controllable Segmentation-Based Text-Guided Style Editing
Controllable Segmentation-Based Text-Guided Style Editing
Jingwen Li
Aravind Chandrasekar
Mariana Rocha
Chao Li
Yuqing Chen
53
0
0
20 Mar 2025
Single Image Iterative Subject-driven Generation and Editing
Single Image Iterative Subject-driven Generation and Editing
Yair Shpitzer
Gal Chechik
Idan Schwartz
50
0
0
20 Mar 2025
Scale-wise Distillation of Diffusion Models
Scale-wise Distillation of Diffusion Models
Nikita Starodubcev
Denis Kuznedelev
Artem Babenko
Dmitry Baranchuk
DiffM
50
0
0
20 Mar 2025
Controlling Avatar Diffusion with Learnable Gaussian Embedding
Controlling Avatar Diffusion with Learnable Gaussian Embedding
Xuan Gao
Jingtao Zhou
Dongyu Liu
Yuqi Zhou
Juyong Zhang
3DGS
DiffM
46
0
0
20 Mar 2025
REVAL: A Comprehension Evaluation on Reliability and Values of Large Vision-Language Models
REVAL: A Comprehension Evaluation on Reliability and Values of Large Vision-Language Models
Jie M. Zhang
Zheng Yuan
Z. Wang
Bei Yan
Sibo Wang
Xiangkui Cao
Zonghui Guo
Shiguang Shan
Xilin Chen
ELM
44
0
0
20 Mar 2025
SynCity: Training-Free Generation of 3D Worlds
SynCity: Training-Free Generation of 3D Worlds
Paul Engstler
Aleksandar Shtedritski
Iro Laina
Christian Rupprecht
Andrea Vedaldi
3DGS
101
1
0
20 Mar 2025
VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Joint Modeling
VideoRFSplat: Direct Scene-Level Text-to-3D Gaussian Splatting Generation with Flexible Pose and Multi-View Joint Modeling
Hyojun Go
Byeongjun Park
Hyelin Nam
Byung-Hoon Kim
Hyungjin Chung
Changick Kim
3DGS
VGen
94
1
0
20 Mar 2025
Repurposing 2D Diffusion Models with Gaussian Atlas for 3D Generation
Repurposing 2D Diffusion Models with Gaussian Atlas for 3D Generation
Tiange Xiang
Kai Li
Chengjiang Long
Christian Hane
Peihong Guo
Scott Delp
Ehsan Adeli
L. Fei-Fei
DiffM
3DGS
47
0
0
20 Mar 2025
RL4Med-DDPO: Reinforcement Learning for Controlled Guidance Towards Diverse Medical Image Generation using Vision-Language Foundation Models
RL4Med-DDPO: Reinforcement Learning for Controlled Guidance Towards Diverse Medical Image Generation using Vision-Language Foundation Models
Parham Saremi
Amar Kumar
Mohammed Mohammed
Zahra Tehraninasab
Tal Arbel
LM&MA
MedIm
39
0
0
20 Mar 2025
MiLA: Multi-view Intensive-fidelity Long-term Video Generation World Model for Autonomous Driving
MiLA: Multi-view Intensive-fidelity Long-term Video Generation World Model for Autonomous Driving
Haiguang Wang
Daqi Liu
Hongwei Xie
Haisong Liu
Enhui Ma
Kaicheng Yu
Limin Wang
Bing Wang
VGen
72
0
0
20 Mar 2025
Learning 3D Scene Analogies with Neural Contextual Scene Maps
Learning 3D Scene Analogies with Neural Contextual Scene Maps
Junho Kim
Gwangtak Bae
E. Lee
Young Min Kim
3DPC
3DV
62
0
0
20 Mar 2025
Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation
Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation
Y. Wang
Zhijie Lin
Yao Teng
Yuanzhi Zhu
Shuhuai Ren
Jiashi Feng
Xihui Liu
53
0
0
20 Mar 2025
A Recipe for Generating 3D Worlds From a Single Image
A Recipe for Generating 3D Worlds From a Single Image
Katja Schwarz
Denys Rozumnyi
Samuel Rota Buló
Lorenzo Porzi
Peter Kontschieder
VGen
79
1
0
20 Mar 2025
EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation
EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation
Zihao Zhang
Haoran Chen
Haoyu Zhao
Guansong Lu
Yanwei Fu
Hang Xu
Zuxuan Wu
VGen
DiffM
62
0
0
20 Mar 2025
TriTex: Learning Texture from a Single Mesh via Triplane Semantic Features
TriTex: Learning Texture from a Single Mesh via Triplane Semantic Features
Dana Cohen-Bar
Daniel Cohen-Or
Gal Chechik
Yoni Kasten
42
0
0
20 Mar 2025
EDiT: Efficient Diffusion Transformers with Linear Compressed Attention
EDiT: Efficient Diffusion Transformers with Linear Compressed Attention
Philipp Becker
Abhinav Mehrotra
Ruchika Chavhan
Malcolm Chadwick
Luca Morreale
Mehdi Noroozi
Alberto Gil C. P. Ramos
Sourav Bhattacharya
46
0
0
20 Mar 2025
Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts
Temporal Score Analysis for Understanding and Correcting Diffusion Artifacts
Yu Cao
Zengqun Zhao
Ioannis Patras
Shaogang Gong
DiffM
48
0
0
20 Mar 2025
SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation
SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation
Chun-Han Yao
Yiming Xie
Vikram S. Voleti
Huaizu Jiang
Varun Jampani
3DGS
VGen
65
0
0
20 Mar 2025
Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation
Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation
Jiyuan Wang
Chunyu Lin
Cheng Guan
Lang Nie
Jing He
Haodong Li
K. Liao
Yao Zhao
DiffM
MDE
66
0
0
20 Mar 2025
M2N2V2: Multi-Modal Unsupervised and Training-free Interactive Segmentation
M2N2V2: Multi-Modal Unsupervised and Training-free Interactive Segmentation
Markus Karmann
Peng-Tao Jiang
Bo Li
O. Urfalioglu
42
0
0
20 Mar 2025
Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding
Cross-Modal and Uncertainty-Aware Agglomeration for Open-Vocabulary 3D Scene Understanding
Jinlong Li
Cristiano Saltori
Fabio Poiesi
N. Sebe
162
0
0
20 Mar 2025
BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers
BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers
Hui Zhang
Tingwei Gao
Jie Shao
Zuxuan Wu
69
0
0
20 Mar 2025
Tokenize Image as a Set
Tokenize Image as a Set
Zigang Geng
Mengde Xu
Han Hu
Shuyang Gu
DiffM
53
0
0
20 Mar 2025
Previous
123...151617...161162163
Next