Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2106.15282
Cited By
v1
v2
v3 (latest)
Cascaded Diffusion Models for High Fidelity Image Generation
Journal of machine learning research (JMLR), 2021
30 May 2021
Jonathan Ho
Chitwan Saharia
William Chan
David J. Fleet
Mohammad Norouzi
Tim Salimans
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Cascaded Diffusion Models for High Fidelity Image Generation"
50 / 964 papers shown
Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment
Bryan S Kim
Jeongsol Kim
Jong Chul Ye
388
3
0
24 May 2025
Forward-only Diffusion Probabilistic Models
Ziwei Luo
Fredrik K. Gustafsson
Jens Sjölund
Thomas B. Schön
406
0
0
22 May 2025
Conditional Panoramic Image Generation via Masked Autoregressive Modeling
Chaoyang Wang
Xiangtai Li
Lu Qi
X. Lin
Jinbin Bai
Qianyu Zhou
Yunhai Tong
DiffM
321
3
0
22 May 2025
Cascaded Diffusion Models for Neural Motion Planning
IEEE International Conference on Robotics and Automation (ICRA), 2025
Mohit Sharma
Adam Fishman
Vikash Kumar
Chris Paxton
Oliver Kroemer
220
1
0
21 May 2025
Learning to Integrate Diffusion ODEs by Averaging the Derivatives
Wenze Liu
Xiangyu Yue
402
4
0
20 May 2025
MVAR: Visual Autoregressive Modeling with Scale and Spatial Markovian Conditioning
Jinhua Zhang
Wei Long
Minghao Han
Weiyi You
Shuhang Gu
BDL
281
2
0
19 May 2025
Constraint-Aware Diffusion Guidance for Robotics: Real-Time Obstacle Avoidance for Autonomous Racing
Hao Ma
Sabrina Bodmer
Andrea Carron
Melanie Zeilinger
Michael Muehlebach
218
2
0
19 May 2025
Restoration Score Distillation: From Corrupted Diffusion Pretraining to One-Step High-Quality Generation
Yasi Zhang
Tianyu Chen
Zhendong Wang
Ying Nian Wu
Mingyuan Zhou
Oscar Leong
DiffM
221
2
0
19 May 2025
DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling
Yuang Ai
Qihang Fan
Xuefeng Hu
Zhenheng Yang
Xiao-Yu Zhang
Huaibo Huang
DiffM
356
1
0
16 May 2025
One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework
Feiran Li
Qianqian Xu
Shilong Bao
Zhiyong Yang
Xiaochun Cao
Qingming Huang
DiffM
532
4
0
16 May 2025
Improving the Euclidean Diffusion Generation of Manifold Data by Mitigating Score Function Singularity
Ziqiang Liu
Wei Zhang
Tiejun Li
DiffM
393
1
0
15 May 2025
Good Things Come in Pairs: Paired Autoencoders for Inverse Problems
Matthias Chung
Bas Peters
Michael Solomon
268
4
0
10 May 2025
Computationally Efficient Diffusion Models in Medical Imaging: A Comprehensive Review
Abdullah
Wei Chen
Ickjai Lee
Euijoon Ahn
MedIm
424
2
0
09 May 2025
DiffVQA: Video Quality Assessment Using Diffusion Feature Extractor
Wei-Ting Chen
Yu-Jiet Vong
Yi-Tsung Lee
Sy-Yen Kuo
Qiang Gao
Sizhuo Ma
Jian Wang
1.0K
1
0
06 May 2025
No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves
Dengyang Jiang
Mengmeng Wang
Liuzhuozheng Li
Lei Zhang
Haoyu Wang
Wei Wei
Guang Dai
Yanning Zhang
Jingdong Wang
DiffM
531
15
0
05 May 2025
AI Alignment in Medical Imaging: Unveiling Hidden Biases Through Counterfactual Analysis
Haroui Ma
Francesco Quinzan
Theresa Willem
Stefan Bauer
279
0
0
28 Apr 2025
Emergence and Evolution of Interpretable Concepts in Diffusion Models
Berk Tinaz
Zalan Fabian
Mahdi Soltanolkotabi
DiffM
258
7
0
21 Apr 2025
TWIG: Two-Step Image Generation using Segmentation Masks in Diffusion Models
Mazharul Islam Rakib
Showrin Rahman
Joyanta Jyoti Mondal
Xi Xiao
David Lewis
Alessandra Mileo
Meem Arafat Manab
DiffM
342
0
0
21 Apr 2025
Diffusion-Driven Inertial Generated Data for Smartphone Location Classification
Noa Cohen
Rotem Dror
Itzik Klein
DiffM
166
0
0
20 Apr 2025
ARAP-GS: Drag-driven As-Rigid-As-Possible 3D Gaussian Splatting Editing with Diffusion Prior
Xiao Han
RunZe Tian
Yifei Tong
Fenggen Yu
Dingyao Liu
Yan Zhang
3DGS
252
1
0
17 Apr 2025
Image Editing with Diffusion Models: A Survey
Jia Wang
Jie Hu
Xiaoqi Ma
Hanghang Ma
Xiaoming Wei
Enhua Wu
322
5
0
17 Apr 2025
Anti-Aesthetics: Protecting Facial Privacy against Customized Text-to-Image Synthesis
Songping Wang
Yueming Lyu
Shiqi Liu
Ning Li
Tong Tong
Hao Sun
Caifeng Shan
PICV
414
0
0
16 Apr 2025
VideoPanda: Video Panoramic Diffusion with Multi-view Attention
Kevin Xie
Amirmojtaba Sabour
Jiahui Huang
Despoina Paschalidou
G. Klár
Umar Iqbal
Sanja Fidler
Fangyin Wei
VGen
MDE
407
4
0
15 Apr 2025
Efficient Generative Model Training via Embedded Representation Warmup
Deyuan Liu
Peng Sun
Xufeng Li
Tao Lin
471
0
0
14 Apr 2025
SD-ReID: View-aware Stable Diffusion for Aerial-Ground Person Re-Identification
Xiang Hu
Silong Yong
Yuhao Wang
Bin Yan
Huchuan Lu
323
5
0
13 Apr 2025
ZipIR: Latent Pyramid Diffusion Transformer for High-Resolution Image Restoration
Yongsheng Yu
Haitian Zheng
Zhifei Zhang
Jianming Zhang
Yuqian Zhou
Connelly Barnes
Yixiao Liu
Wei Xiong
Zhe Lin
Jiebo Luo
360
1
0
11 Apr 2025
Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos
Rundong Luo
Matthew Wallingford
Ali Farhadi
Noah Snavely
Wei-Chiu Ma
VGen
410
6
0
10 Apr 2025
PixelFlow: Pixel-Space Generative Models with Flow
Shoufa Chen
Chongjian Ge
Shilong Zhang
Peize Sun
Ping Luo
VLM
DRL
259
17
0
10 Apr 2025
Can You Count to Nine? A Human Evaluation Benchmark for Counting Limits in Modern Text-to-Video Models
Xuyang Guo
Zekai Huang
Jiayan Huo
Yingyu Liang
Zhenmei Shi
Zhao Song
Jiahao Zhang
ALM
VGen
490
13
0
05 Apr 2025
VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning
Xianwei Zhuang
Yuxin Xie
Yufan Deng
Dongchao Yang
Liming Liang
Jinghan Ru
Yuguo Yin
Yuexian Zou
341
15
0
03 Apr 2025
DiET-GS: Diffusion Prior and Event Stream-Assisted Motion Deblurring 3D Gaussian Splatting
Computer Vision and Pattern Recognition (CVPR), 2025
Seungjun Lee
Gim Hee Lee
3DGS
DiffM
270
4
0
31 Mar 2025
HiPART: Hierarchical Pose AutoRegressive Transformer for Occluded 3D Human Pose Estimation
Computer Vision and Pattern Recognition (CVPR), 2025
Hongwei Zheng
Han Li
Wenrui Dai
Ziyang Zheng
Chenglin Li
Junni Zou
Hongkai Xiong
3DH
255
6
0
30 Mar 2025
DC-SGD: Differentially Private SGD with Dynamic Clipping through Gradient Norm Distribution Estimation
IEEE Transactions on Information Forensics and Security (TIFS), 2025
Chengkun Wei
Weixian Li
Chen Gong
Wenzhi Chen
324
3
0
29 Mar 2025
SyncSDE: A Probabilistic Framework for Diffusion Synchronization
Computer Vision and Pattern Recognition (CVPR), 2025
Hyunjun Lee
Hyunsoo Lee
Sookwan Han
DiffM
457
1
0
27 Mar 2025
MAR-3D: Progressive Masked Auto-regressor for High-Resolution 3D Generation
Computer Vision and Pattern Recognition (CVPR), 2025
Jinnan Chen
Lingting Zhu
Zeyu Hu
Shengju Qian
Yuxiao Chen
Xin Wang
G. Lee
497
6
0
26 Mar 2025
GIViC: Generative Implicit Video Compression
Ge Gao
Siyue Teng
Tianhao Peng
Fan Zhang
David Bull
DiffM
VGen
364
6
0
25 Mar 2025
Latent Space Super-Resolution for Higher-Resolution Image Generation with Diffusion Models
Computer Vision and Pattern Recognition (CVPR), 2025
Jinho Jeong
Sangmin Han
Jinwoo Kim
Seon Joo Kim
352
11
0
24 Mar 2025
Training-free Diffusion Acceleration with Bottleneck Sampling
Ye Tian
Xin Xia
Yuxi Ren
Shanchuan Lin
Xing Wang
Xuefeng Xiao
Yunhai Tong
L. Yang
Tengjiao Wang
469
8
0
24 Mar 2025
DiffusedWrinkles: A Diffusion-Based Model for Data-Driven Garment Animation
British Machine Vision Conference (BMVC), 2025
R. Vidaurre
Elena Garces
Dan Casas
DiffM
AI4CE
272
1
0
24 Mar 2025
HyperLoRA: Parameter-Efficient Adaptive Generation for Portrait Synthesis
Computer Vision and Pattern Recognition (CVPR), 2025
Mengtian Li
Jinshu Chen
Wanquan Feng
Bingchuan Li
Fei Dai
Mingcong Liu
Qian He
3DH
246
3
0
21 Mar 2025
Scale-wise Distillation of Diffusion Models
Nikita Starodubcev
Denis Kuznedelev
Artem Babenko
Dmitry Baranchuk
DiffM
287
4
0
20 Mar 2025
A Recipe for Generating 3D Worlds From a Single Image
Katja Schwarz
Denys Rozumnyi
Samuel Rota Buló
Lorenzo Porzi
Peter Kontschieder
VGen
275
8
0
20 Mar 2025
Improving Autoregressive Image Generation through Coarse-to-Fine Token Prediction
Ziyao Guo
Jianchao Tan
Michael Qizhe Shieh
209
5
0
20 Mar 2025
Generative Gaussian Splatting: Generating 3D Scenes with Video Diffusion Priors
Katja Schwarz
Norman Mueller
Peter Kontschieder
3DGS
301
11
0
17 Mar 2025
Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards
Computer Vision and Pattern Recognition (CVPR), 2025
Zijing Hu
Tai-wei Chang
Long Chen
Kun Kuang
Jiahui Li
Kaifeng Gao
Jun Xiao
X. Wang
Wenwu Zhu
EGVM
573
16
0
14 Mar 2025
RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models
Yijing Lin
Mengqi Huang
Shuhan Zhuang
Zhendong Mao
VGen
312
12
0
13 Mar 2025
Channel-wise Noise Scheduled Diffusion for Inverse Rendering in Indoor Scenes
Computer Vision and Pattern Recognition (CVPR), 2025
JunYong Choi
M. Sagong
SeokYeong Lee
Seung-Won Jung
Ig-Jae Kim
Junghyun Cho
DiffM
227
0
0
13 Mar 2025
Autoregressive Image Generation with Vision Full-view Prompt
Miaomiao Cai
G. Wang
Wei Li
Zhijun Tu
Hanting Chen
Shaohui Lin
Jie Hu
LRM
449
0
0
13 Mar 2025
SARA: Structural and Adversarial Representation Alignment for Training-efficient Diffusion Models
Hesen Chen
Junyan Wang
Zhiyu Tan
Hao Li
270
4
0
11 Mar 2025
Unleashing the Potential of Large Language Models for Text-to-Image Generation through Autoregressive Representation Alignment
Xing Xie
Jiawei Liu
Ziyue Lin
Huijie Fan
Zhi Han
Yandong Tang
Liangqiong Qu
425
0
0
10 Mar 2025
Previous
1
2
3
4
5
6
...
18
19
20
Next