Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2212.09748
Cited By
v1
v2 (latest)
Scalable Diffusion Models with Transformers
IEEE International Conference on Computer Vision (ICCV), 2022
19 December 2022
William S. Peebles
Saining Xie
GNN
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (18 upvotes)
Papers citing
"Scalable Diffusion Models with Transformers"
50 / 2,711 papers shown
On the Design Fundamentals of Diffusion Models: A Survey
Pattern Recognition (Pattern Recogn.), 2023
Ziyi Chang
George Alex Koulieris
Hyung Jin Chang
Hubert P. H. Shum
DiffM
624
79
0
07 Jun 2023
PLANNER: Generating Diversified Paragraph via Latent Language Diffusion Model
Neural Information Processing Systems (NeurIPS), 2023
Yizhe Zhang
Jiatao Gu
Zhuofeng Wu
Shuangfei Zhai
J. Susskind
Navdeep Jaitly
DiffM
389
44
0
05 Jun 2023
SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds
Neural Information Processing Systems (NeurIPS), 2023
Yanyu Li
Huan Wang
Qing Jin
Ju Hu
Pavlo Chemerys
Yun Fu
Yanzhi Wang
Sergey Tulyakov
Jian Ren
VLM
339
234
0
01 Jun 2023
Coneheads: Hierarchy Aware Attention
Neural Information Processing Systems (NeurIPS), 2023
Albert Tseng
Tao Yu
Toni J.B. Liu
Chris De Sa
3DPC
264
7
0
01 Jun 2023
Addressing Negative Transfer in Diffusion Models
Neural Information Processing Systems (NeurIPS), 2023
Hyojun Go
Jinyoung Kim
Yunsung Lee
Seunghyun Lee
Shinhyeok Oh
Hyeongdon Moon
Seungtaek Choi
DiffM
VLM
545
31
0
01 Jun 2023
Humans in 4D: Reconstructing and Tracking Humans with Transformers
IEEE International Conference on Computer Vision (ICCV), 2023
Shubham Goel
Georgios Pavlakos
Jathushan Rajasegaran
Angjoo Kanazawa
Jitendra Malik
3DH
387
309
0
31 May 2023
A Unified Framework for U-Net Design and Analysis
Neural Information Processing Systems (NeurIPS), 2023
Christopher Williams
Fabian Falck
George Deligiannidis
Chris Holmes
Arnaud Doucet
Saifuddin Syed
SSeg
AI4CE
234
62
0
31 May 2023
Nested Diffusion Processes for Anytime Image Generation
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Noam Elata
Bahjat Kawar
T. Michaeli
Michael Elad
DiffM
251
6
0
30 May 2023
Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation
Jia-Bin Huang
Yi Ren
Rongjie Huang
Dongchao Yang
Zhenhui Ye
Chen Zhang
Jinglin Liu
Xiang Yin
Zejun Ma
Zhou Zhao
DiffM
205
98
0
29 May 2023
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2023
Haoran He
Chenjia Bai
Kang Xu
Zhuoran Yang
Weinan Zhang
Dong Wang
Bingyan Zhao
Xuelong Li
DiffM
OffRL
352
135
0
29 May 2023
UDPM: Upsampling Diffusion Probabilistic Models
Neural Information Processing Systems (NeurIPS), 2023
Shady Abu Hussein
Raja Giryes
DiffM
394
4
0
25 May 2023
Knowledge Diffusion for Distillation
Neural Information Processing Systems (NeurIPS), 2023
Tao Huang
Yuan Zhang
Mingkai Zheng
Shan You
Fei Wang
Chao Qian
Chang Xu
323
87
0
25 May 2023
T1: Scaling Diffusion Probabilistic Fields to High-Resolution on Unified Visual Modalities
Kangfu Mei
Mo Zhou
Vishal M. Patel
DiffM
331
1
0
24 May 2023
VDT: General-purpose Video Diffusion Transformers via Mask Modeling
International Conference on Learning Representations (ICLR), 2023
Haoyu Lu
Guoxing Yang
Nanyi Fei
Yuqi Huo
Zhiwu Lu
Ping Luo
Mingyu Ding
DiffM
VGen
223
99
0
22 May 2023
U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech
Xin Jing
Yi Chang
Zijiang Yang
Jiang-jian Xie
Andreas Triantafyllopoulos
Bjoern W. Schuller
206
11
0
22 May 2023
Is Synthetic Data From Diffusion Models Ready for Knowledge Distillation?
Zheng Li
Yuxuan Li
Penghai Zhao
Renjie Song
Xiang Li
Jian Yang
197
24
0
22 May 2023
ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Huadai Liu
Rongjie Huang
Xuan Lin
Wenqiang Xu
Maozong Zheng
Hong Chen
Jinzheng He
Zhou Zhao
DiffM
333
31
0
22 May 2023
Guided Motion Diffusion for Controllable Human Motion Synthesis
IEEE International Conference on Computer Vision (ICCV), 2023
Korrawe Karunratanakul
Konpat Preechakul
Supasorn Suwajanakorn
Siyu Tang
DiffM
425
204
0
21 May 2023
Learning Joint 2D & 3D Diffusion Models for Complete Molecule Generation
Han Huang
Leilei Sun
Bowen Du
Weifeng Lv
DiffM
286
22
0
21 May 2023
LaCon: Late-Constraint Diffusion for Steerable Guided Image Synthesis
Yu Xie
Rui Li
Kaidong Zhang
Xin Luo
Dong Liu
DiffM
464
5
0
19 May 2023
Controllable Mind Visual Diffusion Model
AAAI Conference on Artificial Intelligence (AAAI), 2023
Bo-Wen Zeng
Shanglin Li
Xuhui Liu
Sicheng Gao
Xiaolong Jiang
Xu Tang
Feng-Long Xie
Jianzhuang Liu
Baochang Zhang
DiffM
223
37
0
17 May 2023
Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed Opportunity
International Conference on Medical Imaging with Deep Learning (MIDL), 2023
Raman Dutt
Linus Ericsson
Pedro Sanchez
Sotirios A. Tsaftaris
Timothy M. Hospedales
MedIm
482
73
0
14 May 2023
Visual Tuning
ACM Computing Surveys (ACM Comput. Surv.), 2023
Bruce X. B. Yu
Jianlong Chang
Haixin Wang
Lin Liu
Shijie Wang
...
Lingxi Xie
Haojie Li
Zhouchen Lin
Qi Tian
Chang Wen Chen
VLM
438
59
0
10 May 2023
BoDiffusion: Diffusing Sparse Observations for Full-Body Human Motion Synthesis
Angela Castillo
María Escobar
Guillaume Jeanneret
Albert Pumarola
Pablo Arbelaez
Ali K. Thabet
A. Sanakoyeu
DiffM
VGen
217
39
0
21 Apr 2023
Refusion: Enabling Large-Size Realistic Image Restoration with Latent-Space Diffusion Models
Ziwei Luo
Fredrik K. Gustafsson
Zhengli Zhao
Jens Sjölund
Thomas B. Schon
188
160
0
17 Apr 2023
Control3Diff: Learning Controllable 3D Diffusion Models from Single-view Images
International Conference on 3D Vision (3DV), 2023
Jiatao Gu
Qingzhe Gao
Shuangfei Zhai
Baoquan Chen
Lingjie Liu
J. Susskind
267
35
0
13 Apr 2023
DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning
IEEE International Conference on Computer Vision (ICCV), 2023
Enze Xie
Lewei Yao
Han Shi
Zhili Liu
Daquan Zhou
Zhaoqiang Liu
Jiawei Li
Zhenguo Li
615
91
0
13 Apr 2023
Intriguing properties of synthetic images: from generative adversarial networks to diffusion models
Riccardo Corvi
D. Cozzolino
Giovanni Poggi
Koki Nagano
L. Verdoliva
DiffM
313
142
0
13 Apr 2023
Revisiting the Evaluation of Image Synthesis with GANs
Neural Information Processing Systems (NeurIPS), 2023
Mengping Yang
Ceyuan Yang
Yichi Zhang
Qingyan Bai
Yujun Shen
Bo Dai
EGVM
278
10
0
04 Apr 2023
Your Diffusion Model is Secretly a Zero-Shot Classifier
IEEE International Conference on Computer Vision (ICCV), 2023
Alexander C. Li
Mihir Prabhudesai
Shivam Duggal
Ellis L Brown
Deepak Pathak
DiffM
VLM
686
309
0
28 Mar 2023
The Stable Signature: Rooting Watermarks in Latent Diffusion Models
IEEE International Conference on Computer Vision (ICCV), 2023
Pierre Fernandez
Guillaume Couairon
Edouard Grave
Matthijs Douze
Teddy Furon
WIGM
327
298
0
27 Mar 2023
PDPP: Projected Diffusion for Procedure Planning in Instructional Videos
Computer Vision and Pattern Recognition (CVPR), 2023
Hanlin Wang
Yilu Wu
Sheng Guo
Limin Wang
VGen
DiffM
423
34
0
26 Mar 2023
MDTv2: Masked Diffusion Transformer is a Strong Image Synthesizer
IEEE International Conference on Computer Vision (ICCV), 2023
Shanghua Gao
Pan Zhou
Mingg-Ming Cheng
Shuicheng Yan
DiffM
1.1K
248
0
25 Mar 2023
CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion
Geonmo Gu
Sanghyuk Chun
Wonjae Kim
HeeJae Jun
Yoohoon Kang
Sangdoo Yun
DiffM
550
77
0
21 Mar 2023
Polynomial Implicit Neural Representations For Large Diverse Datasets
Computer Vision and Pattern Recognition (CVPR), 2023
Rajhans Singh
Ankita Shukla
Pavan Turaga
AI4CE
202
28
0
20 Mar 2023
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
IEEE International Conference on Computer Vision (ICCV), 2023
Ligong Han
Yinxiao Li
Han Zhang
P. Milanfar
Dimitris N. Metaxas
Feng Yang
DiffM
668
367
0
20 Mar 2023
Denoising Diffusion Autoencoders are Unified Self-supervised Learners
IEEE International Conference on Computer Vision (ICCV), 2023
Weilai Xiang
Hongyu Yang
Di Huang
Yunhong Wang
DiffM
465
119
0
17 Mar 2023
Efficient Diffusion Training via Min-SNR Weighting Strategy
IEEE International Conference on Computer Vision (ICCV), 2023
Tiankai Hang
Shuyang Gu
Chen Li
Jianmin Bao
Dong Chen
Han Hu
Xin Geng
B. Guo
305
220
0
16 Mar 2023
ResDiff: Combining CNN and Diffusion Model for Image Super-Resolution
AAAI Conference on Artificial Intelligence (AAAI), 2023
Shuyao Shang
Zhengyang Shan
Guangxing Liu
LunQian Wang
XingHua Wang
Zekai Zhang
Jingling Zhang
DiffM
277
136
0
15 Mar 2023
Editing Implicit Assumptions in Text-to-Image Diffusion Models
IEEE International Conference on Computer Vision (ICCV), 2023
Hadas Orgad
Bahjat Kawar
Yonatan Belinkov
DiffM
363
115
0
14 Mar 2023
Scaling up GANs for Text-to-Image Synthesis
Computer Vision and Pattern Recognition (CVPR), 2023
Minguk Kang
Jun-Yan Zhu
Richard Y. Zhang
Jaesik Park
Eli Shechtman
Sylvain Paris
Taesung Park
325
597
0
09 Mar 2023
TRACT: Denoising Diffusion Models with Transitive Closure Time-Distillation
David Berthelot
Arnaud Autef
Jierui Lin
Dian Ang Yap
Shuangfei Zhai
Siyuan Hu
Daniel Zheng
Walter Talbot
Eric Gu
DiffM
249
119
0
07 Mar 2023
DLT: Conditioned layout generation with Joint Discrete-Continuous Diffusion Layout Transformer
IEEE International Conference on Computer Vision (ICCV), 2023
Elad Levi
Eli Brosh
Mykola Mykhailych
Meir Perez
DiffM
201
27
0
07 Mar 2023
Understanding Diffusion Objectives as the ELBO with Simple Data Augmentation
Neural Information Processing Systems (NeurIPS), 2023
Diederik P. Kingma
Ruiqi Gao
DiffM
744
238
0
01 Mar 2023
Unlimited-Size Diffusion Restoration
Yinhuai Wang
Jiwen Yu
Runyi Yu
Jian Zhang
192
16
0
01 Mar 2023
Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels
Neural Information Processing Systems (NeurIPS), 2023
Zebin You
Yong Zhong
Fan Bao
Jiacheng Sun
Chongxuan Li
Jun Zhu
DiffM
VLM
528
50
0
21 Feb 2023
A Reparameterized Discrete Diffusion Model for Text Generation
Lin Zheng
Jianbo Yuan
Lei Yu
Lingpeng Kong
DiffM
285
114
0
11 Feb 2023
Q-Diffusion: Quantizing Diffusion Models
IEEE International Conference on Computer Vision (ICCV), 2023
Xiuyu Li
Yijia Liu
Long Lian
Hua Yang
Zhen Dong
Daniel Kang
Shanghang Zhang
Kurt Keutzer
DiffM
MQ
374
237
0
08 Feb 2023
Structure and Content-Guided Video Synthesis with Diffusion Models
IEEE International Conference on Computer Vision (ICCV), 2023
Patrick Esser
Johnathan Chiu
Parmida Atighehchian
Jonathan Granskog
Anastasis Germanidis
DiffM
VGen
379
663
0
06 Feb 2023
Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines?
International Conference on Machine Learning (ICML), 2023
Victor Boutin
Thomas Fel
Lakshya Singhal
Rishav Mukherji
Akash Nagaraj
Julien Colin
Thomas Serre
DiffM
274
10
0
27 Jan 2023
Previous
1
2
3
...
53
54
55
Next