v1v2v3 (latest)

Cascaded Diffusion Models for High Fidelity Image Generation

Journal of machine learning research (JMLR), 2021

30 May 2021

David J. Fleet

Papers citing "Cascaded Diffusion Models for High Fidelity Image Generation"

50 / 964 papers shown

Reconstructing Multi-Scale Physical Fields from Extremely Sparse Measurements with an Autoencoder-Diffusion Cascade

01 Dec 2025

RoleMotion: A Large-Scale Dataset towards Robust Scene-Specific Role-Playing Motion Synthesis with Fine-grained Descriptions

...

104

01 Dec 2025

Spatiotemporal Pyramid Flow Matching for Climate Emulation

Nomin-Erdene Bayarsaikhan

01 Dec 2025

TalkingPose: Efficient Face and Gesture Animation with Feedback-guided Diffusion Model

134

30 Nov 2025

NeuroVolve: Evolving Visual Stimuli toward Programmable Neural Objectives

29 Nov 2025

Rethinking Cross-Generator Image Forgery Detection through DINOv3

27 Nov 2025

Flowing Backwards: Improving Normalizing Flows via Reverse Representation Alignment

27 Nov 2025

Do You See What I Say? Generalizable Deepfake Detection based on Visual Speech Recognition

27 Nov 2025

PixelDiT: Pixel Diffusion Transformers for Image Generation

266

25 Nov 2025

STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flows

Miguel Angel Bautista

298

25 Nov 2025

MFM-point: Multi-scale Flow Matching for Point Cloud Generation

228

25 Nov 2025

Training-Free Generation of Diverse and High-Fidelity Images via Prompt Semantic Space Optimization

Georgios Tzimiropoulos

DiffM

264

25 Nov 2025

Exo2EgoSyn: Unlocking Foundation Video Generation Models for Exocentric-to-Egocentric Video Synthesis

203

25 Nov 2025

One Attention, One Scale: Phase-Aligned Rotary Positional Embeddings for Mixed-Resolution Diffusion Transformer

24 Nov 2025

DiP: Taming Diffusion Models in Pixel Space

280

24 Nov 2025

ActVAR: Activating Mixtures of Weights and Tokens for Efficient Visual Autoregressive Generation

131

17 Nov 2025

Improved Masked Image Generation with Knowledge-Augmented Token Representations

120

15 Nov 2025

AvatarTex: High-Fidelity Facial Texture Reconstruction from Single-Image Stylized Avatars

180

10 Nov 2025

PercHead: Perceptual Head Model for Single-Image 3D Head Reconstruction & Editing

Antonio Oroz

Matthias Nießner

Tobias Kirschstein

100

04 Nov 2025

An efficient probabilistic hardware architecture for diffusion-like models

Guillaume Verdon

Trevor McCourt

DiffM

203

28 Oct 2025

See the Speaker: Crafting High-Resolution Talking Faces from Speech with Prior Guidance and Region RefinementIEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2025

112

28 Oct 2025

FARMER: Flow AutoRegressive Transformer over Pixels

251

27 Oct 2025

Optimize Any Topology: A Foundation Model for Shape- and Resolution-Free Structural Topology Optimization

136

26 Oct 2025

Blockwise Flow Matching: Improving Flow Matching Models For Efficient High-Quality Generation

117

24 Oct 2025

Improved Training Technique for Shortcut Models

219

24 Oct 2025

Sprint: Sparse-Dense Residual Fusion for Efficient Diffusion Transformers

124

24 Oct 2025

PoseCrafter: Extreme Pose Estimation with Hybrid Video Synthesis

22 Oct 2025

Gradient Variance Reveals Failure Modes in Flow-Based Generative Models

204

20 Oct 2025

CanvasMAR: Improving Masked Autoregressive Video Generation With Canvas

Zian Li

Muhan Zhang

DiffM VGen

146

15 Oct 2025

End-to-End Multi-Modal Diffusion Mamba

130

15 Oct 2025

LayerSync: Self-aligning Intermediate Layers

115

14 Oct 2025

There is No VAE: End-to-End Pixel-Space Generative Modeling via Self-Supervised Pre-training

261

14 Oct 2025

Efficient High-Resolution Image Editing with Hallucination-Aware Loss and Adaptive Tiling

Young D. Kwon

Abhinav Mehrotra

Malcolm Chadwick

Alberto Gil C. P. Ramos

S. Bhattacharya

DiffM

164

07 Oct 2025

Riddled basin geometry sets fundamental limits to predictability and reproducibility in deep learning

Andrew Ly

Pulin Gong

AI4CE

184

07 Oct 2025

Multi-scale Autoregressive Models are Laplacian, Discrete, and Latent Diffusion Models in Disguise

Steve Hong

Samuel Belkadi

DiffM

03 Oct 2025

Growing Visual Generative Capacity for Pre-Trained MLLMs

195

02 Oct 2025

Purrception: Variational Flow Matching for Vector-Quantized Image Generation

Răzvan-Andrei Matişan

Jan-Willem van de Meent

Mohammad Mahdi Derakhshani

Floor Eijkelboom

137

01 Oct 2025

Syntax-Guided Diffusion Language Models with User-Integrated Personalization

128

01 Oct 2025

Cascaded Diffusion Framework for Probabilistic Coarse-to-Fine Hand Pose Estimation

144

01 Oct 2025

Query-Kontext: An Unified Multimodal Model for Image Generation and Editing

...

153

30 Sep 2025

DiffAU: Diffusion-Based Ambisonics Upscaling

Amit Milstein

Stefano Rini

Boaz Rafaely

121

30 Sep 2025

OAT-FM: Optimal Acceleration Transport for Improved Flow Matching

Angxiao Yue

Anqi Dong

Hongteng Xu

357

29 Sep 2025

Tumor Synthesis conditioned on RadiomicsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2025

200

29 Sep 2025

Tunable-Generalization Diffusion Powered by Self-Supervised Contextual Sub-Data for Low-Dose CT Reconstruction

150

28 Sep 2025

Stochastic Interpolants via Conditional Dependent Coupling

148

27 Sep 2025

HiGS: History-Guided Sampling for Plug-and-Play Enhancement of Diffusion Models

160

26 Sep 2025

Score-based Idempotent Distillation of Diffusion Models

148

25 Sep 2025

No Alignment Needed for Generation: Learning Linearly Separable Representations in Diffusion Models

Junno Yun

Yasar Utku Alçalar

Mehmet Akçakaya

117

25 Sep 2025

Audio Super-Resolution with Latent Bridge Models

331

22 Sep 2025

Deep Learning Empowered Super-Resolution: A Comprehensive Survey and Future ProspectsProceedings of the IEEE (Proc. IEEE), 2025

284

19 Sep 2025