Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2212.09748
Cited By
v1
v2 (latest)
Scalable Diffusion Models with Transformers
IEEE International Conference on Computer Vision (ICCV), 2022
19 December 2022
William S. Peebles
Saining Xie
GNN
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (18 upvotes)
Papers citing
"Scalable Diffusion Models with Transformers"
50 / 2,712 papers shown
Locality-Aware Generalizable Implicit Neural Representation
Neural Information Processing Systems (NeurIPS), 2023
Doyup Lee
Chiheon Kim
Minsu Cho
Wook-Shin Han
261
19
0
09 Oct 2023
Perceptual Artifacts Localization for Image Synthesis Tasks
IEEE International Conference on Computer Vision (ICCV), 2023
Lingzhi Zhang
Zhengjie Xu
Connelly Barnes
Yuqian Zhou
Qing Liu
Chentao Song
Sohrab Amirghodsi
Zhe Lin
Eli Shechtman
Jianbo Shi
DiffM
246
39
0
09 Oct 2023
IPDreamer: Appearance-Controllable 3D Object Generation with Complex Image Prompts
International Conference on Learning Representations (ICLR), 2023
Bo-Wen Zeng
Shanglin Li
Yutang Feng
Ling Yang
Hong Li
...
Conghui He
Wentao Zhang
Jianzhuang Liu
Baochang Zhang
Shuicheng Yan
DiffM
289
7
0
09 Oct 2023
The Emergence of Reproducibility and Generalizability in Diffusion Models
Huijie Zhang
Jinfan Zhou
Yifu Lu
Minzhe Guo
Peng Wang
Liyue Shen
Qing Qu
DiffM
302
15
0
08 Oct 2023
Assessing Robustness via Score-Based Adversarial Image Generation
Marcel Kollovieh
Lukas Gosch
Yan Scholten
Marten Lienen
Leo Schwinn
Stephan Günnemann
DiffM
550
6
0
06 Oct 2023
Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent Diffusion
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Anton Razzhigaev
Arseniy Shakhmatov
Anastasia Maltseva
V.Ya. Arkhipkin
Igor Pavlov
Ilya Ryabov
Angelina Kuts
Sergey Petrakov
Andrey Kuznetsov
Denis Dimitrov
334
118
0
05 Oct 2023
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model
International Conference on Learning Representations (ICLR), 2023
Zibin Dong
Yifu Yuan
Jianye Hao
Fei Ni
Yao Mu
Yan Zheng
Yujing Hu
Tangjie Lv
Changjie Fan
Zhipeng Hu
255
40
0
03 Oct 2023
PixArt-
α
α
α
: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
International Conference on Learning Representations (ICLR), 2023
Junsong Chen
Jincheng Yu
Chongjian Ge
Lewei Yao
Enze Xie
...
Zhongdao Wang
James T. Kwok
Ping Luo
Huchuan Lu
Zhenguo Li
DiffM
600
680
0
30 Sep 2023
GAIA-1: A Generative World Model for Autonomous Driving
Masane Fuchi
Lloyd Russell
Hudson Yeo
Zak Murez
Hiroto Minami
Alex Kendall
Tomohiro Takagi
Gianluca Corrado
VGen
399
422
0
29 Sep 2023
AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive Computation
European Conference on Computer Vision (ECCV), 2023
Shengkun Tang
Yaqing Wang
Maksim Dzhigil
Yi Liang
Yongbin Li
Dongkuan Xu
253
9
0
29 Sep 2023
Denoising Diffusion Bridge Models
International Conference on Learning Representations (ICLR), 2023
Linqi Zhou
Aaron Lou
Samar Khanna
Stefano Ermon
DiffM
422
129
0
29 Sep 2023
Text-to-3D using Gaussian Splatting
Computer Vision and Pattern Recognition (CVPR), 2023
Manish Sharma
Moitreya Chatterjee
Yikai Wang
Huaping Liu
3DGS
494
330
0
28 Sep 2023
Dream the Impossible: Outlier Imagination with Diffusion Models
Neural Information Processing Systems (NeurIPS), 2023
Xuefeng Du
Yiyou Sun
Xiaojin Zhu
Shouqing Yang
332
87
0
23 Sep 2023
GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided Video DecodER
Neural Information Processing Systems (NeurIPS), 2023
Mingzhen Sun
Weining Wang
Zihan Qin
Jiahui Sun
Si-Qing Chen
Qingbin Liu
DiffM
160
5
0
23 Sep 2023
DreamLLM: Synergistic Multimodal Comprehension and Creation
International Conference on Learning Representations (ICLR), 2023
Runpei Dong
Chunrui Han
Yuang Peng
Zekun Qi
Zheng Ge
...
Hao-Ran Wei
Xiangwen Kong
Xiangyu Zhang
Kaisheng Ma
Li Yi
MLLM
299
275
0
20 Sep 2023
Cartoondiff: Training-free Cartoon Image Generation with Diffusion Transformer Models
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Feihong He
Gang Li
Hui Xiong
Leilei Yan
Shimeng Hou
Hongwei Dong
Fanzhang Li
DiffM
236
9
0
15 Sep 2023
Large-Vocabulary 3D Diffusion Model with Transformer
International Conference on Learning Representations (ICLR), 2023
Ziang Cao
Fangzhou Hong
Tong Wu
Liang Pan
Ziwei Liu
DiffM
295
50
0
14 Sep 2023
SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models
Neural Information Processing Systems (NeurIPS), 2023
Shuchen Xue
Mingyang Yi
Weijian Luo
Shifeng Zhang
Jiacheng Sun
Hao Sun
Zhi-Ming Ma
DiffM
540
67
0
10 Sep 2023
MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask
International Journal of Computer Vision (IJCV), 2023
Yupeng Zhou
Daquan Zhou
Zuo-Liang Zhu
Yaxing Wang
Qibin Hou
Jiashi Feng
174
13
0
08 Sep 2023
Relay Diffusion: Unifying diffusion process across resolutions for image synthesis
International Conference on Learning Representations (ICLR), 2023
Jiayan Teng
Wendi Zheng
Ming Ding
Wenyi Hong
Jianqiao Wangni
Zhuoyi Yang
Jie Tang
DiffM
234
71
0
04 Sep 2023
VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation
Xin Li
Wenqing Chu
Ye Wu
Weihang Yuan
Fanglong Liu
Tao Gui
Fu Li
Haocheng Feng
Errui Ding
Jingdong Wang
VGen
332
70
0
01 Sep 2023
Elucidating the Exposure Bias in Diffusion Models
International Conference on Learning Representations (ICLR), 2023
Mang Ning
Mingxiao Li
Jianlin Su
A. A. Salah
Itir Onal Ertugrul
DiffM
520
73
0
29 Aug 2023
Towards Large-scale 3D Representation Learning with Multi-dataset Point Prompt Training
Computer Vision and Pattern Recognition (CVPR), 2023
Xiaoyang Wu
Zhuotao Tian
Xin Wen
Bohao Peng
Xihui Liu
Kaicheng Yu
Hengshuang Zhao
206
77
0
18 Aug 2023
Learning A Coarse-to-Fine Diffusion Transformer for Image Restoration
Liyan Wang
Qinyu Yang
Cong Wang
Wen Wang
Jin-shan Pan
Zhixun Su
DiffM
226
6
0
17 Aug 2023
Accelerating Diffusion-based Combinatorial Optimization Solvers by Progressive Distillation
Junwei Huang
Zhiqing Sun
Yiming Yang
DiffM
124
6
0
12 Aug 2023
Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model
Fan Zhang
Naye Ji
Fuxing Gao
Siyuan Zhao
Zhaohan Wang
Shunman Li
249
0
0
11 Aug 2023
The Paradigm Shifts in Artificial Intelligence
Communications of the ACM (CACM), 2023
V. Dhar
AI4TS
AI4CE
152
8
0
02 Aug 2023
Memory Encoding Model
Huzheng Yang
James C. Gee
Jianbo Shi
135
7
0
02 Aug 2023
Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry
Neural Information Processing Systems (NeurIPS), 2023
Yong-Hyun Park
Mingi Kwon
J. Choi
Junghyo Jo
Youngjung Uh
DiffM
447
109
0
24 Jul 2023
Diffusion Sampling with Momentum for Mitigating Divergence Artifacts
International Conference on Learning Representations (ICLR), 2023
Suttisak Wizadwongsa
Worameth Chinchuthakun
Pramook Khungurn
Amit Raj
Supasorn Suwajanakorn
DiffM
291
2
0
20 Jul 2023
BSDM: Background Suppression Diffusion Model for Hyperspectral Anomaly Detection
Haonan Qin
Weiying Xie
Yunsong Li
Leyuan Fang
DiffM
111
17
0
19 Jul 2023
Flow Matching in Latent Space
Quan Dao
Hao Phung
Binh Duc Nguyen
Anh Tran
362
110
0
17 Jul 2023
Complexity Matters: Rethinking the Latent Space for Generative Modeling
Neural Information Processing Systems (NeurIPS), 2023
Tianyang Hu
Fei Chen
Hong Wang
Jiawei Li
Wei Cao
Jiacheng Sun
Hao Sun
DiffM
320
17
0
17 Jul 2023
DreamTeacher: Pretraining Image Backbones with Deep Generative Models
IEEE International Conference on Computer Vision (ICCV), 2023
Daiqing Li
Huan Ling
Amlan Kar
David Acuna
Seung Wook Kim
Karsten Kreis
Antonio Torralba
Sanja Fidler
VLM
DiffM
266
34
0
14 Jul 2023
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
International Conference on Learning Representations (ICLR), 2023
Yuwei Guo
Ceyuan Yang
Anyi Rao
Zhengyang Liang
Yaohui Wang
Yu Qiao
Maneesh Agrawala
Dahua Lin
Bo Dai
VGen
946
1,296
0
10 Jul 2023
Detecting Images Generated by Deep Diffusion Models using their Local Intrinsic Dimensionality
P. Lorenz
Ricard Durall
J. Keuper
DiffM
615
52
0
05 Jul 2023
SVDM: Single-View Diffusion Model for Pseudo-Stereo 3D Object Detection
Yuguang Shi
DiffM
258
2
0
05 Jul 2023
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
International Conference on Learning Representations (ICLR), 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
1.7K
3,891
0
04 Jul 2023
DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation
Neural Information Processing Systems (NeurIPS), 2023
Shentong Mo
Enze Xie
Ruihang Chu
Lewei Yao
Lanqing Hong
Matthias Nießner
Zhenguo Li
194
110
0
04 Jul 2023
Spiking Denoising Diffusion Probabilistic Models
IEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Jiahang Cao
Ziqing Wang
Hanzhong Guo
Haotai Cheng
Qiang Zhang
Renjing Xu
DiffM
307
22
0
29 Jun 2023
Federated Generative Learning with Foundation Models
Jie Zhang
Xiaohua Qi
Bo Zhao
FedML
295
28
0
28 Jun 2023
Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision
Neural Information Processing Systems (NeurIPS), 2023
A. Tewari
Tianwei Yin
George Cazenavette
Semon Rezchikov
J. Tenenbaum
F. Durand
William T. Freeman
Vincent Sitzmann
DiffM
411
113
0
20 Jun 2023
EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model
Li-Ping Yin
Yijun Wang
Tianyu He
Jinming Liu
Wei Zhao
Bohan Li
Xin Jin
Jianxin Lin
DiffM
187
21
0
20 Jun 2023
Masked Diffusion Models Are Fast Distribution Learners
Jiachen Lei
Qinglong Wang
Pengyu Cheng
Zhongjie Ba
Zhan Qin
Peng Kuang
Zhenguang Liu
Kui Ren
DiffM
457
4
0
20 Jun 2023
ArtFusion: Controllable Arbitrary Style Transfer using Dual Conditional Latent Diffusion Models
Da Chen
DiffM
224
6
0
15 Jun 2023
Fast Training of Diffusion Models with Masked Transformers
Hongkai Zheng
Weili Nie
Arash Vahdat
Anima Anandkumar
DiffM
324
132
0
15 Jun 2023
Conditional Human Sketch Synthesis with Explicit Abstraction Control
Da Chen
DiffM
149
1
0
15 Jun 2023
DiffAug: A Diffuse-and-Denoise Augmentation for Training Robust Classifiers
Neural Information Processing Systems (NeurIPS), 2023
Chandramouli Shama Sastry
Sri Harsha Dumpala
Sageev Oore
279
3
0
15 Jun 2023
Extraction and Recovery of Spatio-Temporal Structure in Latent Dynamics Alignment with Diffusion Models
Neural Information Processing Systems (NeurIPS), 2023
Yule Wang
Zijing Wu
Chengrui Li
Anqi Wu
DiffM
325
15
0
09 Jun 2023
BOOT: Data-free Distillation of Denoising Diffusion Models with Bootstrapping
Jiatao Gu
Shuangfei Zhai
Yizhe Zhang
Lingjie Liu
J. Susskind
DiffM
216
94
0
08 Jun 2023
Previous
1
2
3
...
52
53
54
55
Next
Page 53 of 55
Page
of 55
Go