ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.15282
  4. Cited By
Cascaded Diffusion Models for High Fidelity Image Generation
v1v2v3 (latest)

Cascaded Diffusion Models for High Fidelity Image Generation

Journal of machine learning research (JMLR), 2021
30 May 2021
Jonathan Ho
Chitwan Saharia
William Chan
David J. Fleet
Mohammad Norouzi
Tim Salimans
ArXiv (abs)PDFHTML

Papers citing "Cascaded Diffusion Models for High Fidelity Image Generation"

50 / 964 papers shown
Reconstructing Multi-Scale Physical Fields from Extremely Sparse Measurements with an Autoencoder-Diffusion Cascade
Reconstructing Multi-Scale Physical Fields from Extremely Sparse Measurements with an Autoencoder-Diffusion Cascade
Letian Yi
Tingpeng Zhang
Mingyuan Zhou
Guannan Wang
Quanke Su
Zhilu Lai
DiffM
56
0
0
01 Dec 2025
RoleMotion: A Large-Scale Dataset towards Robust Scene-Specific Role-Playing Motion Synthesis with Fine-grained Descriptions
Junran Peng
Yiheng Huang
Silei Shen
ZeJi Wei
Jingwei Yang
...
Yonghao He
Chuanchen Luo
M. Zhang
Xucheng Yin
Wei Sui
104
0
0
01 Dec 2025
Spatiotemporal Pyramid Flow Matching for Climate Emulation
Spatiotemporal Pyramid Flow Matching for Climate Emulation
Jeremy Irvin
Jiaqi Han
Z. Wang
Abdulaziz Alharbi
Yufei Zhao
Nomin-Erdene Bayarsaikhan
Daniele Visioni
A. Ng
Duncan Watson-Parris
AI4TS
84
0
0
01 Dec 2025
TalkingPose: Efficient Face and Gesture Animation with Feedback-guided Diffusion Model
Alireza Javanmardi
Pragati Jaiswal
T. Habtegebrial
Christen Millerdurai
Shaoxiang Wang
A. Pagani
Didier Stricker
DiffMVGen
134
0
0
30 Nov 2025
NeuroVolve: Evolving Visual Stimuli toward Programmable Neural Objectives
NeuroVolve: Evolving Visual Stimuli toward Programmable Neural Objectives
Haomiao Chen
K. Jamison
M. Sabuncu
Amy Kuceyeski
DiffM
28
0
0
29 Nov 2025
Rethinking Cross-Generator Image Forgery Detection through DINOv3
Rethinking Cross-Generator Image Forgery Detection through DINOv3
Zhenglin Huang
Jason Li
Haiquan Wen
Tianxiao Li
Xi Yang
Lu Qi
Bei Peng
Xiaowei Huang
Ming-Hsuan Yang
Guangliang Cheng
69
0
0
27 Nov 2025
Flowing Backwards: Improving Normalizing Flows via Reverse Representation Alignment
Flowing Backwards: Improving Normalizing Flows via Reverse Representation Alignment
Yang Chen
Xiaowei Xu
S. Wang
C. Zhu
Ruxue Wen
X. Li
Tiezheng Ge
Limin Wang
60
0
0
27 Nov 2025
Do You See What I Say? Generalizable Deepfake Detection based on Visual Speech Recognition
Do You See What I Say? Generalizable Deepfake Detection based on Visual Speech Recognition
Maheswar Bora
Tashvik Dhamija
Shukesh Reddy
Baptiste Chopin
P. Balaji
Abhijit Das
A. Dantcheva
96
0
0
27 Nov 2025
PixelDiT: Pixel Diffusion Transformers for Image Generation
PixelDiT: Pixel Diffusion Transformers for Image Generation
Yongsheng Yu
Wei Xiong
Weili Nie
Yichen Sheng
Shiqiu Liu
Jiebo Luo
266
0
0
25 Nov 2025
STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flows
STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flows
Jiatao Gu
Ying Shen
Tianrong Chen
Laurent Dinh
Y. Wang
Miguel Angel Bautista
David Berthelot
Josh Susskind
Shuangfei Zhai
DiffMVGen
298
3
0
25 Nov 2025
MFM-point: Multi-scale Flow Matching for Point Cloud Generation
MFM-point: Multi-scale Flow Matching for Point Cloud Generation
Petr Molodyk
Jaemoo Choi
David W. Romero
Ming-Yu Liu
Yongxin Chen
3DPC
228
0
0
25 Nov 2025
Training-Free Generation of Diverse and High-Fidelity Images via Prompt Semantic Space Optimization
Training-Free Generation of Diverse and High-Fidelity Images via Prompt Semantic Space Optimization
Debin Meng
Chen Jin
Zheng Gao
Yanran Li
Ioannis Patras
Georgios Tzimiropoulos
DiffM
264
0
0
25 Nov 2025
Exo2EgoSyn: Unlocking Foundation Video Generation Models for Exocentric-to-Egocentric Video Synthesis
Exo2EgoSyn: Unlocking Foundation Video Generation Models for Exocentric-to-Egocentric Video Synthesis
Mohammad Mahdi
Yuqian Fu
N. Savov
Jiancheng Pan
Danda Pani Paudel
Luc Van Gool
VGen
203
1
0
25 Nov 2025
One Attention, One Scale: Phase-Aligned Rotary Positional Embeddings for Mixed-Resolution Diffusion Transformer
One Attention, One Scale: Phase-Aligned Rotary Positional Embeddings for Mixed-Resolution Diffusion Transformer
Haoyu Wu
Jingyi Xu
Qiaomu Miao
Dimitris Samaras
H. Le
88
0
0
24 Nov 2025
DiP: Taming Diffusion Models in Pixel Space
DiP: Taming Diffusion Models in Pixel Space
Z. Chen
J. Zhu
Xu Chen
Jiangning Zhang
Xiaobin Hu
Hanzhen Zhao
C. Wang
Jian Yang
Ying Tai
280
0
0
24 Nov 2025
ActVAR: Activating Mixtures of Weights and Tokens for Efficient Visual Autoregressive Generation
ActVAR: Activating Mixtures of Weights and Tokens for Efficient Visual Autoregressive Generation
Kaixin Zhang
Ruiqing Yang
Yuan Zhang
Shan You
Tao Huang
VLM
131
0
0
17 Nov 2025
Improved Masked Image Generation with Knowledge-Augmented Token Representations
Improved Masked Image Generation with Knowledge-Augmented Token Representations
Guotao Liang
Baoquan Zhang
Zhiyuan Wen
Zihao Han
Yunming Ye
120
0
0
15 Nov 2025
AvatarTex: High-Fidelity Facial Texture Reconstruction from Single-Image Stylized Avatars
AvatarTex: High-Fidelity Facial Texture Reconstruction from Single-Image Stylized Avatars
Yuda Qiu
Zitong Xiao
Yiwei Zuo
Zisheng Ye
Weikai Chen
Xiaoguang Han
DiffM
180
0
0
10 Nov 2025
PercHead: Perceptual Head Model for Single-Image 3D Head Reconstruction & Editing
PercHead: Perceptual Head Model for Single-Image 3D Head Reconstruction & Editing
Antonio Oroz
Matthias Nießner
Tobias Kirschstein
100
2
0
04 Nov 2025
An efficient probabilistic hardware architecture for diffusion-like models
An efficient probabilistic hardware architecture for diffusion-like models
Andraž Jelinčič
Owen Lockwood
Akhil Garlapati
Guillaume Verdon
Trevor McCourt
Guillaume Verdon
Trevor McCourt
DiffM
203
3
0
28 Oct 2025
See the Speaker: Crafting High-Resolution Talking Faces from Speech with Prior Guidance and Region Refinement
See the Speaker: Crafting High-Resolution Talking Faces from Speech with Prior Guidance and Region RefinementIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025
Jinting Wang
Jun Wang
Hei Victor Cheng
Li Liu
DiffM
112
0
0
28 Oct 2025
FARMER: Flow AutoRegressive Transformer over Pixels
FARMER: Flow AutoRegressive Transformer over Pixels
Guangting Zheng
Qinyu Zhao
Tao Yang
Fei Xiao
Zhijie Lin
Jie Wu
Jiajun Deng
Y. Zhang
Rui Zhu
VGen
251
4
0
27 Oct 2025
Optimize Any Topology: A Foundation Model for Shape- and Resolution-Free Structural Topology Optimization
Optimize Any Topology: A Foundation Model for Shape- and Resolution-Free Structural Topology Optimization
Amin Heyrani Nobari
Lyle Regenwetter
Cyril Picard
Ligong Han
F. Ahmed
136
0
0
26 Oct 2025
Blockwise Flow Matching: Improving Flow Matching Models For Efficient High-Quality Generation
Blockwise Flow Matching: Improving Flow Matching Models For Efficient High-Quality Generation
Dogyun Park
Taehoon Lee
Minseok Joo
Hyunwoo J. Kim
117
1
0
24 Oct 2025
Improved Training Technique for Shortcut Models
Improved Training Technique for Shortcut Models
Anh-Tien Nguyen
Viet-Anh Nguyen
D. Vu
T. Dao
Chi Tran
Toan M. Tran
Anh Tran
BDL
219
1
0
24 Oct 2025
Sprint: Sparse-Dense Residual Fusion for Efficient Diffusion Transformers
Sprint: Sparse-Dense Residual Fusion for Efficient Diffusion Transformers
Dogyun Park
Moayed Haji-Ali
Yanyu Li
Willi Menapace
Sergey Tulyakov
Hyunwoo J. Kim
Aliaksandr Siarohin
Anil Kag
124
2
0
24 Oct 2025
PoseCrafter: Extreme Pose Estimation with Hybrid Video Synthesis
PoseCrafter: Extreme Pose Estimation with Hybrid Video Synthesis
Qing Mao
Tianxin Huang
Yu Zhu
Jinqiu Sun
Y. Zhang
Gim Hee Lee
84
1
0
22 Oct 2025
Gradient Variance Reveals Failure Modes in Flow-Based Generative Models
Gradient Variance Reveals Failure Modes in Flow-Based Generative Models
Teodora Reu
Sixtine Dromigny
Michael M Bronstein
Francisco Vargas
204
1
0
20 Oct 2025
CanvasMAR: Improving Masked Autoregressive Video Generation With Canvas
CanvasMAR: Improving Masked Autoregressive Video Generation With Canvas
Zian Li
Muhan Zhang
DiffMVGen
146
0
0
15 Oct 2025
End-to-End Multi-Modal Diffusion Mamba
End-to-End Multi-Modal Diffusion Mamba
Chunhao Lu
Qiang Lu
Meichen Dong
Jake Luo
130
3
0
15 Oct 2025
LayerSync: Self-aligning Intermediate Layers
LayerSync: Self-aligning Intermediate Layers
Yasaman Haghighi
B. V. Delft
Mariam Hassan
Alexandre Alahi
115
0
0
14 Oct 2025
There is No VAE: End-to-End Pixel-Space Generative Modeling via Self-Supervised Pre-training
There is No VAE: End-to-End Pixel-Space Generative Modeling via Self-Supervised Pre-training
Jiachen Lei
Keli Liu
Julius Berner
Haiming Yu
Hongkai Zheng
Jiahong Wu
Xiangxiang Chu
DiffM
261
2
0
14 Oct 2025
Efficient High-Resolution Image Editing with Hallucination-Aware Loss and Adaptive Tiling
Efficient High-Resolution Image Editing with Hallucination-Aware Loss and Adaptive Tiling
Young D. Kwon
Abhinav Mehrotra
Malcolm Chadwick
Alberto Gil C. P. Ramos
S. Bhattacharya
DiffM
164
0
0
07 Oct 2025
Riddled basin geometry sets fundamental limits to predictability and reproducibility in deep learning
Riddled basin geometry sets fundamental limits to predictability and reproducibility in deep learning
Andrew Ly
Pulin Gong
AI4CE
184
0
0
07 Oct 2025
Multi-scale Autoregressive Models are Laplacian, Discrete, and Latent Diffusion Models in Disguise
Multi-scale Autoregressive Models are Laplacian, Discrete, and Latent Diffusion Models in Disguise
Steve Hong
Samuel Belkadi
DiffM
96
0
0
03 Oct 2025
Growing Visual Generative Capacity for Pre-Trained MLLMs
Growing Visual Generative Capacity for Pre-Trained MLLMs
Hanyu Wang
Jiaming Han
Ziyan Yang
Qi Zhao
Shanchuan Lin
Xiangyu Yue
Abhinav Shrivastava
Zhenheng Yang
Hao Chen
VLM
195
0
0
02 Oct 2025
Purrception: Variational Flow Matching for Vector-Quantized Image Generation
Purrception: Variational Flow Matching for Vector-Quantized Image Generation
Răzvan-Andrei Matişan
Vincent Tao Hu
Grigory Bartosh
Bjorn Ommer
Cees G. M. Snoek
Max Welling
Jan-Willem van de Meent
Mohammad Mahdi Derakhshani
Floor Eijkelboom
137
1
0
01 Oct 2025
Syntax-Guided Diffusion Language Models with User-Integrated Personalization
Syntax-Guided Diffusion Language Models with User-Integrated Personalization
Ruqian Zhang
Yijiao Zhang
Juan Shen
Zhongyi Zhu
Annie Qu
DiffM
128
0
0
01 Oct 2025
Cascaded Diffusion Framework for Probabilistic Coarse-to-Fine Hand Pose Estimation
Cascaded Diffusion Framework for Probabilistic Coarse-to-Fine Hand Pose Estimation
Taeyun Woo
Jinah Park
Tae-Kyun Kim
DiffM
144
0
0
01 Oct 2025
Query-Kontext: An Unified Multimodal Model for Image Generation and Editing
Query-Kontext: An Unified Multimodal Model for Image Generation and Editing
Yuxin Song
Wenkai Dong
Shizun Wang
Qi Zhang
Song Xue
...
H. Yang
Haocheng Feng
Hang Zhou
Xinyan Xiao
Jingdong Wang
DiffMMLLM
153
2
0
30 Sep 2025
DiffAU: Diffusion-Based Ambisonics Upscaling
DiffAU: Diffusion-Based Ambisonics Upscaling
Amit Milstein
Stefano Rini
Boaz Rafaely
121
0
0
30 Sep 2025
OAT-FM: Optimal Acceleration Transport for Improved Flow Matching
OAT-FM: Optimal Acceleration Transport for Improved Flow Matching
Angxiao Yue
Anqi Dong
Hongteng Xu
OT
357
1
0
29 Sep 2025
Tumor Synthesis conditioned on Radiomics
Tumor Synthesis conditioned on RadiomicsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025
Jonghun Kim
Inye Na
Eun Sook Ko
Hyunjin Park
MedIm
200
1
0
29 Sep 2025
Tunable-Generalization Diffusion Powered by Self-Supervised Contextual Sub-Data for Low-Dose CT Reconstruction
Tunable-Generalization Diffusion Powered by Self-Supervised Contextual Sub-Data for Low-Dose CT Reconstruction
Guoquan Wei
Liu Shi
Liu Shi
Wenzhe Shan
Qiegen Liu
DiffMMedIm
150
0
0
28 Sep 2025
Stochastic Interpolants via Conditional Dependent Coupling
Stochastic Interpolants via Conditional Dependent Coupling
Chenrui Ma
Xi Xiao
Tianyang Wang
Xiao Wang
Yanning Shen
DiffM
148
3
0
27 Sep 2025
HiGS: History-Guided Sampling for Plug-and-Play Enhancement of Diffusion Models
HiGS: History-Guided Sampling for Plug-and-Play Enhancement of Diffusion Models
Seyedmorteza Sadat
Farnood Salehi
Romann M. Weber
DiffM
160
0
0
26 Sep 2025
Score-based Idempotent Distillation of Diffusion Models
Score-based Idempotent Distillation of Diffusion Models
Shehtab Zaman
Chengyan Liu
Kenneth Chiu
DiffM
148
0
0
25 Sep 2025
No Alignment Needed for Generation: Learning Linearly Separable Representations in Diffusion Models
No Alignment Needed for Generation: Learning Linearly Separable Representations in Diffusion Models
Junno Yun
Yasar Utku Alçalar
Mehmet Akçakaya
117
1
0
25 Sep 2025
Audio Super-Resolution with Latent Bridge Models
Audio Super-Resolution with Latent Bridge Models
Chang Li
Zehua Chen
Liyuan Wang
Jun Zhu
331
3
0
22 Sep 2025
Deep Learning Empowered Super-Resolution: A Comprehensive Survey and Future Prospects
Deep Learning Empowered Super-Resolution: A Comprehensive Survey and Future ProspectsProceedings of the IEEE (Proc. IEEE), 2025
Le Zhang
Ao Li
Qibin Hou
Ce Zhu
Yonina C. Eldar
SupR
284
1
0
19 Sep 2025
1234...181920
Next