ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.15282
  4. Cited By
Cascaded Diffusion Models for High Fidelity Image Generation
v1v2v3 (latest)

Cascaded Diffusion Models for High Fidelity Image Generation

Journal of machine learning research (JMLR), 2021
30 May 2021
Jonathan Ho
Chitwan Saharia
William Chan
David J. Fleet
Mohammad Norouzi
Tim Salimans
ArXiv (abs)PDFHTML

Papers citing "Cascaded Diffusion Models for High Fidelity Image Generation"

50 / 964 papers shown
HumanRefiner: Benchmarking Abnormal Human Generation and Refining with
  Coarse-to-fine Pose-Reversible Guidance
HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance
Guian Fang
Wenbiao Yan
Yuanfan Guo
J. N. Han
Zutao Jiang
Hang Xu
Shengcai Liao
Xiaodan Liang
207
12
0
09 Jul 2024
Image-Conditional Diffusion Transformer for Underwater Image Enhancement
Image-Conditional Diffusion Transformer for Underwater Image Enhancement
Xingyang Nie
Su Pan
Xiaoyu Zhai
Shifei Tao
Fengzhong Qu
Biao Wang
Huilin Ge
Guojie Xiao
181
3
0
07 Jul 2024
Multi-scale Conditional Generative Modeling for Microscopic Image
  Restoration
Multi-scale Conditional Generative Modeling for Microscopic Image Restoration
Luzhe Huang
Xiongye Xiao
Shixuan Li
Jiawen Sun
Yi Huang
Aydogan Ozcan
Paul Bogdan
MedImDiffM
196
5
0
07 Jul 2024
DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents
DisCo-Diff: Enhancing Continuous Diffusion Models with Discrete Latents
Yilun Xu
Gabriele Corso
Tommi Jaakkola
Arash Vahdat
Karsten Kreis
321
22
0
03 Jul 2024
A Comprehensive Survey on Diffusion Models and Their Applications
A Comprehensive Survey on Diffusion Models and Their Applications
M. Ahsan
S. Raman
Yingtao Liu
Zahed Siddique
MedImDiffM
392
6
0
01 Jul 2024
FORA: Fast-Forward Caching in Diffusion Transformer Acceleration
FORA: Fast-Forward Caching in Diffusion Transformer Acceleration
Pratheba Selvaraju
Tianyu Ding
Tianyi Chen
Ilya Zharkov
Luming Liang
398
67
0
01 Jul 2024
From Efficient Multimodal Models to World Models: A Survey
From Efficient Multimodal Models to World Models: A Survey
Xinji Mai
Zeng Tao
Junxiong Lin
Haoran Wang
Yang Chang
Yanlan Kang
Yan Wang
Wenqiang Zhang
306
14
0
27 Jun 2024
MultiDiff: Consistent Novel View Synthesis from a Single Image
MultiDiff: Consistent Novel View Synthesis from a Single Image
Norman Muller
Katja Schwarz
Barbara Roessle
Lorenzo Porzi
Samuel Rota Buló
Matthias Nießner
Peter Kontschieder
DiffM
302
53
0
26 Jun 2024
DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis
  through Structure Guidance
DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure Guidance
Younghyun Kim
Geunmin Hwang
Junyu Zhang
Eunbyung Park
692
26
0
26 Jun 2024
Diffusion Model-Based Video Editing: A Survey
Diffusion Model-Based Video Editing: A Survey
Wenhao Sun
Rong-Cheng Tu
Jingyi Liao
Dacheng Tao
VGen
330
36
0
26 Jun 2024
Long-Term Prediction Accuracy Improvement of Data-Driven Medium-Range
  Global Weather Forecast
Long-Term Prediction Accuracy Improvement of Data-Driven Medium-Range Global Weather Forecast
Yifan Hu
Fukang Yin
Weimin Zhang
Kaijun Ren
Junqiang Song
Kefeng Deng
Di Zhang
AI4Cl
157
0
0
26 Jun 2024
Toward Fairer Face Recognition Datasets
Toward Fairer Face Recognition Datasets
Alexandre Fournier-Mongieux
Michael Soumm
Adrian Daniel Popescu
B. Luvison
Hervé Le Borgne
224
0
0
24 Jun 2024
ResMaster: Mastering High-Resolution Image Generation via Structural and
  Fine-Grained Guidance
ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance
Shuwei Shi
Wenbo Li
Yuechen Zhang
Jingwen He
Biao Gong
Yinqiang Zheng
275
21
0
24 Jun 2024
Identifying and Solving Conditional Image Leakage in Image-to-Video
  Diffusion Model
Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model
Min Zhao
Hongzhou Zhu
Chendong Xiang
Kaiwen Zheng
Chongxuan Li
Jun Zhu
326
21
0
22 Jun 2024
LatentExplainer: Explaining Latent Representations in Deep Generative Models with Multimodal Large Language Models
LatentExplainer: Explaining Latent Representations in Deep Generative Models with Multimodal Large Language Models
Mengdan Zhu
Raasikh Kanjiani
Jiahui Lu
Andrew Choi
Qirui Ye
Liang Zhao
DiffM
425
0
0
21 Jun 2024
Consistency Models Made Easy
Consistency Models Made Easy
Zhengyang Geng
Ashwini Pokle
William Luo
Justin Lin
J. Zico Kolter
263
81
0
20 Jun 2024
Evaluating the design space of diffusion-based generative models
Evaluating the design space of diffusion-based generative modelsNeural Information Processing Systems (NeurIPS), 2024
Yuqing Wang
Ye He
Molei Tao
DiffM
367
17
0
18 Jun 2024
Mixing Natural and Synthetic Images for Robust Self-Supervised
  Representations
Mixing Natural and Synthetic Images for Robust Self-Supervised Representations
Reza Akbarian Bafghi
Nidhin Harilal
C. Monteleoni
M. Raissi
DiffM
282
1
0
18 Jun 2024
Diffusion Models in Low-Level Vision: A Survey
Diffusion Models in Low-Level Vision: A Survey
Chunming He
Yuqi Shen
Chengyu Fang
Fengyang Xiao
Longxiang Tang
Yulun Zhang
W. Zuo
Zhenhua Guo
Xiu Li
VLMDiffMMedIm
520
94
0
17 Jun 2024
Neural Pose Representation Learning for Generating and Transferring
  Non-Rigid Object Poses
Neural Pose Representation Learning for Generating and Transferring Non-Rigid Object PosesNeural Information Processing Systems (NeurIPS), 2024
Seungwoo Yoo
Juil Koo
Kyeongmin Yeo
Minhyuk Sung
3DHDRL
220
4
0
14 Jun 2024
Alleviating Distortion in Image Generation via Multi-Resolution
  Diffusion Models
Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models
Qihao Liu
Zhanpeng Zeng
Ju He
Qihang Yu
Xiaohui Shen
Liang-Chieh Chen
266
27
0
13 Jun 2024
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
OmniTokenizer: A Joint Image-Video Tokenizer for Visual Generation
Junke Wang
Yi Jiang
Zehuan Yuan
Binyue Peng
Zuxuan Wu
Yu-Gang Jiang
ViTVGen
310
81
0
13 Jun 2024
Dataset Enhancement with Instance-Level Augmentations
Dataset Enhancement with Instance-Level Augmentations
Orest Kupyn
Christian Rupprecht
268
17
0
12 Jun 2024
Diffusion-Promoted HDR Video Reconstruction
Diffusion-Promoted HDR Video Reconstruction
Yuanshen Guan
Ruikang Xu
Mingde Yao
Ruisheng Gao
Lizhi Wang
Zhiwei Xiong
192
2
0
12 Jun 2024
Hierarchical Patch Diffusion Models for High-Resolution Video Generation
Hierarchical Patch Diffusion Models for High-Resolution Video Generation
Ivan Skorokhodov
Willi Menapace
Aliaksandr Siarohin
Sergey Tulyakov
VGen
249
20
0
12 Jun 2024
Image Neural Field Diffusion Models
Image Neural Field Diffusion Models
Yinbo Chen
Oliver Wang
Richard Zhang
Eli Shechtman
Xiaolong Wang
Michael Gharbi
DiffM
291
11
0
11 Jun 2024
Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with
  Foundation Models
Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models
Athanasios Tragakis
Marco Aversa
Chaitanya Kaul
Roderick Murray-Smith
Daniele Faccio
340
6
0
11 Jun 2024
Autoregressive Model Beats Diffusion: Llama for Scalable Image
  Generation
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
Peize Sun
Yi Jiang
Shoufa Chen
Shilong Zhang
Bingyue Peng
Ping Luo
Zehuan Yuan
VLM
543
544
0
10 Jun 2024
Coherent Zero-Shot Visual Instruction Generation
Coherent Zero-Shot Visual Instruction Generation
Quynh Phung
Songwei Ge
Jia-Bin Huang
378
2
0
06 Jun 2024
DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D
  Data
DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D DataComputer Vision and Pattern Recognition (CVPR), 2024
Qihao Liu
Yi Zhang
Song Bai
Adam Kortylewski
Alan Yuille
266
17
0
06 Jun 2024
Shaping History: Advanced Machine Learning Techniques for the Analysis
  and Dating of Cuneiform Tablets over Three Millennia
Shaping History: Advanced Machine Learning Techniques for the Analysis and Dating of Cuneiform Tablets over Three Millennia
Danielle Kapon
Michael Fire
S. Gordin
316
2
0
06 Jun 2024
Bayesian Power Steering: An Effective Approach for Domain Adaptation of
  Diffusion Models
Bayesian Power Steering: An Effective Approach for Domain Adaptation of Diffusion Models
Ding Huang
Ting Li
Jian Huang
DiffM
252
1
0
06 Jun 2024
ED-SAM: An Efficient Diffusion Sampling Approach to Domain
  Generalization in Vision-Language Foundation Models
ED-SAM: An Efficient Diffusion Sampling Approach to Domain Generalization in Vision-Language Foundation Models
Thanh-Dat Truong
Pawan Sinha
Bhiksha Raj
Jackson Cothren
Khoa Luu
DiffMVLM
263
2
0
03 Jun 2024
Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Layout-Agnostic Scene Text Image Synthesis with Diffusion Models
Qilong Zhangli
Jindong Jiang
Di Liu
Licheng Yu
Xiaoliang Dai
Ankit Ramchandani
Guan Pang
Dimitris N. Metaxas
Praveen Krishnan
DiffM
465
15
0
03 Jun 2024
You Only Scan Once: Efficient Multi-dimension Sequential Modeling with
  LightNet
You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet
Zhen Qin
Yuxin Mao
Xuyang Shen
Dong Li
Jing Zhang
Yuchao Dai
Yiran Zhong
208
8
0
31 May 2024
MotionFollower: Editing Video Motion via Lightweight Score-Guided
  Diffusion
MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion
Shuyuan Tu
Jingdong Sun
Zihao Zhang
Sicheng Xie
Zhi-Qi Cheng
Chong Luo
Xintong Han
Zuxuan Wu
Yu-Gang Jiang
DiffMVGen
216
22
0
30 May 2024
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified
  Flow
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow
Chaoyang Wang
Xiangtai Li
Lu Qi
Henghui Ding
Yunhai Tong
Ming-Hsuan Yang
DiffM
301
13
0
30 May 2024
Patch-enhanced Mask Encoder Prompt Image Generation
Patch-enhanced Mask Encoder Prompt Image Generation
Shusong Xu
Peiye Liu
DiffM
173
1
0
29 May 2024
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
Lianghui Zhu
Zilong Huang
Bencheng Liao
Jun Hao Liew
Hanshu Yan
Jiashi Feng
Xinggang Wang
287
38
0
28 May 2024
Diffusion Rejection Sampling
Diffusion Rejection Sampling
Byeonghu Na
Yeongmin Kim
Minsang Park
DongHyeok Shin
Wanmo Kang
Il-Chul Moon
278
10
0
28 May 2024
FlowSDF: Flow Matching for Medical Image Segmentation Using Distance Transforms
FlowSDF: Flow Matching for Medical Image Segmentation Using Distance Transforms
L. Bogensperger
Dominik Narnhofer
Alexander Falk
Konrad Schindler
Thomas Pock
MedIm
521
11
0
28 May 2024
Vista: A Generalizable Driving World Model with High Fidelity and
  Versatile Controllability
Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
Shenyuan Gao
Jiazhi Yang
Li Chen
Kashyap Chitta
Yihang Qiu
Andreas Geiger
Jun Zhang
Hongyang Li
459
212
0
27 May 2024
$\text{Di}^2\text{Pose}$: Discrete Diffusion Model for Occluded 3D Human
  Pose Estimation
Di2Pose\text{Di}^2\text{Pose}Di2Pose: Discrete Diffusion Model for Occluded 3D Human Pose Estimation
Weiquan Wang
Jun Xiao
Chunping Wang
Wei Liu
Zhao Wang
Long Chen
DiffM
220
1
0
27 May 2024
Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models
Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models
C. N. Vasconcelos
Abdullah Rashwan Austin Waters
Trevor Walker
Keyang Xu
Jimmy Yan
...
Wenlei Zhou
Kevin Swersky
David J. Fleet
Jason Baldridge
Oliver Wang
208
4
0
27 May 2024
CHAMP: Conformalized 3D Human Multi-Hypothesis Pose Estimators
CHAMP: Conformalized 3D Human Multi-Hypothesis Pose Estimators
Harry Zhang
Luca Carlone
3DH
518
3
0
27 May 2024
Global Well-posedness and Convergence Analysis of Score-based Generative
  Models via Sharp Lipschitz Estimates
Global Well-posedness and Convergence Analysis of Score-based Generative Models via Sharp Lipschitz Estimates
Connor Mooney
Zhongjian Wang
Jack Xin
Yifeng Yu
243
3
0
25 May 2024
Accelerating Diffusion Models with Parallel Sampling: Inference at Sub-Linear Time Complexity
Accelerating Diffusion Models with Parallel Sampling: Inference at Sub-Linear Time ComplexityNeural Information Processing Systems (NeurIPS), 2024
Haoxuan Chen
Yinuo Ren
Lexing Ying
Grant M. Rotskoff
321
39
0
24 May 2024
SFDDM: Single-fold Distillation for Diffusion models
SFDDM: Single-fold Distillation for Diffusion models
Chi Hong
Jiyue Huang
Robert Birke
Dick H. J. Epema
Stefanie Roos
Lydia Y. Chen
207
1
0
23 May 2024
Semantica: An Adaptable Image-Conditioned Diffusion Model
Semantica: An Adaptable Image-Conditioned Diffusion Model
Manoj Kumar
N. Houlsby
Emiel Hoogeboom
DiffMVLM
375
0
0
23 May 2024
EditWorld: Simulating World Dynamics for Instruction-Following Image
  Editing
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
Ling Yang
Bo-Wen Zeng
Jiaming Liu
Hong Li
Minghao Xu
Wentao Zhang
Shuicheng Yan
DiffM
218
30
0
23 May 2024
Previous
123...678...181920
Next