Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Title
Home
Papers
2106.15282
Cited By
v1
v2
v3 (latest)
Cascaded Diffusion Models for High Fidelity Image Generation
Journal of machine learning research (JMLR), 2021
30 May 2021
Jonathan Ho
Chitwan Saharia
William Chan
David J. Fleet
Mohammad Norouzi
Tim Salimans
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Cascaded Diffusion Models for High Fidelity Image Generation"
50 / 964 papers shown
Title
Denoising Score Distillation: From Noisy Diffusion Pretraining to One-Step High-Quality Generation
Tianyu Chen
Yasi Zhang
Liang Luo
Ying Nian Wu
Oscar Leong
Mingyuan Zhou
DiffM
439
5
0
10 Mar 2025
NFIG: Multi-Scale Autoregressive Image Generation via Frequency Ordering
Zhihao Huang
Xi Qiu
Yukuo Ma
Yifu Zhou
Junjie Chen
Xuelong Li
Fangqiu Yi
Xuelong Li
VLM
373
2
0
10 Mar 2025
ControlFill: Spatially Adjustable Image Inpainting from Prompt Learning
Boseong Jeon
235
0
0
06 Mar 2025
Reconciling Stochastic and Deterministic Strategies for Zero-shot Image Restoration using Diffusion Model in Dual
Computer Vision and Pattern Recognition (CVPR), 2025
Chong-Jun Wang
Lanqing Guo
Zixuan Fu
Siyuan Yang
Hao Cheng
Alex C. Kot
Bihan Wen
DiffM
293
2
0
03 Mar 2025
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations
Computer Vision and Pattern Recognition (CVPR), 2025
Ziyang Zhang
Yang Yu
Yucheng Chen
Xulei Yang
S. Yeo
MedIm
516
9
0
02 Mar 2025
FlexVAR: Flexible Visual Autoregressive Modeling without Residual Prediction
Siyu Jiao
Gengwei Zhang
Yinlong Qian
Jiancheng Huang
Yao Zhao
Humphrey Shi
Lin Ma
Y. X. Wei
Zequn Jie
VLM
226
18
0
27 Feb 2025
GCDance: Genre-Controlled Music-Driven 3D Full Body Dance Generation
Xinran Liu
Xu Dong
Shenbin Qian
Diptesh Kanojia
Wenwu Wang
Zhenhua Feng
DiffM
355
0
0
25 Feb 2025
Improved Diffusion-based Generative Model with Better Adversarial Robustness
International Conference on Learning Representations (ICLR), 2025
Zekun Wang
Mingyang Yi
Shuchen Xue
Zhiyu Li
Ming Liu
Bing Qin
Zhi-Ming Ma
DiffM
337
0
0
24 Feb 2025
Human2Robot: Learning Robot Actions from Paired Human-Robot Videos
Sicheng Xie
Haidong Cao
Zejia Weng
Zhen Xing
Shiwei Shen
Shiwei Shen
Jiaqi Leng
Yanwei Fu
Zuxuan Wu
415
9
0
23 Feb 2025
PFDiff: Training-Free Acceleration of Diffusion Models Combining Past and Future Scores
International Conference on Learning Representations (ICLR), 2024
Guangyi Wang
Yuren Cai
Lijiang Li
Wei Peng
Songzhi Su
DiffM
298
0
0
21 Feb 2025
Text-to-Image Rectified Flow as Plug-and-Play Priors
International Conference on Learning Representations (ICLR), 2024
Xiaofeng Yang
Cheng Chen
Xulei Yang
Fayao Liu
Guosheng Lin
DiffM
404
22
0
21 Feb 2025
SayAnything: Audio-Driven Lip Synchronization with Conditional Video Diffusion
Junxian Ma
Shiwen Wang
Jian Yang
Junyi Hu
Jian Liang
Guosheng Lin
Jingbo Chen
Kai Li
Yu Meng
DiffM
VGen
329
9
0
17 Feb 2025
Maximize Your Diffusion: A Study into Reward Maximization and Alignment for Diffusion-based Control
Dom Huh
P. Mohapatra
413
1
0
16 Feb 2025
E-MD3C: Taming Masked Diffusion Transformers for Efficient Zero-Shot Object Customization
T. Pham
Zhang Kang
Ji Woo Hong
Xuran Zheng
Chang D. Yoo
256
1
0
13 Feb 2025
Rolling Ahead Diffusion for Traffic Scene Simulation
Yunpeng Liu
Matthew Niedoba
William Harvey
Adam Scibior
Berend Zwartsenberg
Frank Wood
430
1
0
13 Feb 2025
PoGDiff: Product-of-Gaussians Diffusion Models for Imbalanced Text-to-Image Generation
Ziyan Wang
Sizhe Wei
Xiaoming Huo
Hao Wang
DiffM
518
1
0
12 Feb 2025
Satellite Observations Guided Diffusion Model for Accurate Meteorological States at Arbitrary Resolution
Computer Vision and Pattern Recognition (CVPR), 2025
Siwei Tu
Ben Fei
Weidong Yang
Zhangrui Li
Hao Chen
Zili Liu
Kun Chen
Hang Fan
W. Ouyang
Junlin Wu
475
6
0
09 Feb 2025
Revisiting Gradient-based Uncertainty for Monocular Depth Estimation
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Julia Hornauer
Amir El-Ghoussani
Vasileios Belagiannis
UQCV
273
3
0
09 Feb 2025
FreeBlend: Advancing Concept Blending with Staged Feedback-Driven Interpolation Diffusion
Yufan Zhou
Haoyu Shen
Huan Wang
DiffM
631
6
0
08 Feb 2025
DiffListener: Discrete Diffusion Model for Listener Generation
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Siyeol Jung
Taehwan Kim
65
3
0
05 Feb 2025
Assessing the use of Diffusion models for motion artifact correction in brain MRI
IEEE International Symposium on Biomedical Imaging (ISBI), 2025
Paolo Angella
Vito Paolo Pastore
Matteo Santacesaria
MedIm
DiffM
311
2
0
03 Feb 2025
Refining Alignment Framework for Diffusion Models with Intermediate-Step Preference Ranking
Jie Ren
Yuhang Zhang
Dongrui Liu
Xiaopeng Zhang
Qi Tian
273
5
0
01 Feb 2025
Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion
Computer Vision and Pattern Recognition (CVPR), 2025
Vitor Campagnolo Guizilini
Muhammad Zubair Irshad
Dian Chen
G. Shakhnarovich
Rares Andrei Ambrus
DiffM
272
7
0
30 Jan 2025
Slot-Guided Adaptation of Pre-trained Diffusion Models for Object-Centric Learning and Compositional Generation
International Conference on Learning Representations (ICLR), 2025
Adil Kaan Akan
Yucel Yemez
DiffM
OCL
396
6
0
27 Jan 2025
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model
Xianwei Zhuang
Yuxin Xie
Yufan Deng
Liming Liang
Jinghan Ru
Yuguo Yin
Yuexian Zou
MLLM
VLM
LRM
311
27
0
21 Jan 2025
Ditto: Accelerating Diffusion Model via Temporal Value Similarity
International Symposium on High-Performance Computer Architecture (HPCA), 2025
Sungbin Kim
Hyunwuk Lee
Wonho Cho
Mincheol Park
Won Woo Ro
411
8
0
20 Jan 2025
Generative diffusion model with inverse renormalization group flows
Kanta Masuki
Yuto Ashida
DiffM
264
4
0
17 Jan 2025
Likelihood Training of Cascaded Diffusion Models via Hierarchical Volume-preserving Maps
International Conference on Learning Representations (ICLR), 2025
Henry Li
Ronen Basri
Y. Kluger
DiffM
447
3
0
13 Jan 2025
EditAR: Unified Conditional Generation with Autoregressive Models
Computer Vision and Pattern Recognition (CVPR), 2025
Jiteng Mu
Nuno Vasconcelos
Xinyu Wang
DiffM
246
22
0
08 Jan 2025
MC-VTON: Minimal Control Virtual Try-On Diffusion Transformer
Junsheng Luan
Guangyuan Li
Lei Zhao
Wei Xing
DiffM
368
5
0
07 Jan 2025
Brick-Diffusion: Generating Long Videos with Brick-to-Wall Denoising
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025
Yunlong Yuan
Yuanfan Guo
Chunwei Wang
Hang Xu
Li Zhang
VGen
174
2
0
06 Jan 2025
Pointmap-Conditioned Diffusion for Consistent Novel View Synthesis
Thang-Anh-Quan Nguyen
Nathan Piasco
Luis Roldão
Moussâb Bennehar
D. Tsishkou
Laurent Caraffa
J. Tarel
R. Brémond
DiffM
333
3
0
06 Jan 2025
DeTrack: In-model Latent Denoising Learning for Visual Object Tracking
Neural Information Processing Systems (NeurIPS), 2025
Xinyu Zhou
Jinglun Li
Lingyi Hong
Kaixun Jiang
Pinxue Guo
Weifeng Ge
Wenqiang Zhang
DiffM
245
7
0
05 Jan 2025
PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement
Chengyou Jia
Minnan Luo
Zhuohang Dang
Guangwen Dai
Xiao Chang
Jiangming Wang
DiffM
444
1
0
31 Dec 2024
DreamOmni: Unified Image Generation and Editing
Computer Vision and Pattern Recognition (CVPR), 2024
Bin Xia
Yuechen Zhang
Jingyao Li
Chengyao Wang
Yitong Wang
Xinglong Wu
Bei Yu
Jiaya Jia
SyDa
MLLM
365
16
0
22 Dec 2024
Next Patch Prediction for Autoregressive Visual Generation
Yatian Pang
Peng Jin
Shuo Yang
Bin Lin
Bin Zhu
...
Liuhan Chen
Francis E. H. Tay
Ser-Nam Lim
Harry Yang
Li Yuan
587
20
0
19 Dec 2024
Parallelized Autoregressive Visual Generation
Computer Vision and Pattern Recognition (CVPR), 2024
Yanjie Wang
Shuhuai Ren
Zhijie Lin
Yujin Han
Haoyuan Guo
Zhenheng Yang
Difan Zou
Jiashi Feng
Xihui Liu
VGen
630
35
0
19 Dec 2024
EditSplat: Multi-View Fusion and Attention-Guided Optimization for View-Consistent 3D Scene Editing with 3D Gaussian Splatting
Computer Vision and Pattern Recognition (CVPR), 2024
Dong In Lee
Hyeongcheol Park
Jiyoung Seo
Eunbyung Park
Hyunje Park
Ha Dam Baek
Shin Sangheon
Sangmin kim
Sangpil Kim
3DGS
389
17
0
16 Dec 2024
CLIP-SR: Collaborative Linguistic and Image Processing for Super-Resolution
IEEE transactions on multimedia (IEEE TMM), 2024
Bingwen Hu
Heng Liu
Zhedong Zheng
Ping Liu
SupR
551
1
0
16 Dec 2024
Dual-Schedule Inversion: Training- and Tuning-Free Inversion for Real Image Editing
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Jiancheng Huang
Yi Huang
Jianzhuang Liu
Donghao Zhou
Wenshu Fan
Shifeng Chen
DiffM
293
8
0
15 Dec 2024
A Decade of Deep Learning: A Survey on The Magnificent Seven
Dilshod Azizov
Muhammad Arslan Manzoor
Velibor Bojkovic
Yingxu Wang
Liang Luo
...
Liang Li
Houcheng Su
Yu Zhong
Wei Liu
Shangsong Liang
OOD
AI4TS
MedIm
292
0
0
13 Dec 2024
CUPS: Improving Human Pose-Shape Estimators with Conformalized Deep Uncertainty
Harry Zhang
Luca Carlone
3DH
383
0
0
11 Dec 2024
Non-Normal Diffusion Models
Henry Li
VLM
DiffM
241
1
0
10 Dec 2024
[MASK] is All You Need
Vincent Tao Hu
Bjorn Ommer
DiffM
519
8
0
09 Dec 2024
Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation
Computer Vision and Pattern Recognition (CVPR), 2024
Nicolas Dufour
David Picard
Vicky Kalogeiton
Loic Landrieu
227
14
0
09 Dec 2024
Nested Diffusion Models Using Hierarchical Latent Priors
Computer Vision and Pattern Recognition (CVPR), 2024
Xiao Zhang
Ruoxi Jiang
Rebecca Willett
Michael Maire
BDL
DiffM
363
1
0
08 Dec 2024
DIVE: Taming DINO for Subject-Driven Video Editing
Yi Huang
Wei Xiong
Chentao Song
Chaoqi Chen
Jianzhuang Liu
Mingfu Yan
Shifeng Chen
VGen
DiffM
334
7
0
04 Dec 2024
Generative modeling assisted simulation of measurement-altered quantum criticality
Yuchen Zhu
Molei Tao
Yuebo Jin
Xie Chen
209
1
0
02 Dec 2024
DisCoRD: Discrete Tokens to Continuous Motion via Rectified Flow Decoding
Jungbin Cho
Junwan Kim
Jisoo Kim
Minseo Kim
Mingu Kang
S. Hong
Tae-Hyun Oh
Youngjae Yu
VGen
552
6
0
29 Nov 2024
StableAnimator: High-Quality Identity-Preserving Human Image Animation
Computer Vision and Pattern Recognition (CVPR), 2024
Shuyuan Tu
Zhen Xing
Xintong Han
Zhi-Qi Cheng
Jingdong Sun
Chong Luo
Zuxuan Wu
VGen
548
54
0
26 Nov 2024
Previous
1
2
3
4
5
...
18
19
20
Next