ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.09748
  4. Cited By
Scalable Diffusion Models with Transformers
v1v2 (latest)

Scalable Diffusion Models with Transformers

IEEE International Conference on Computer Vision (ICCV), 2022
19 December 2022
William S. Peebles
Saining Xie
    GNN
ArXiv (abs)PDFHTMLHuggingFace (18 upvotes)

Papers citing "Scalable Diffusion Models with Transformers"

50 / 2,712 papers shown
Locality-Aware Generalizable Implicit Neural Representation
Locality-Aware Generalizable Implicit Neural RepresentationNeural Information Processing Systems (NeurIPS), 2023
Doyup Lee
Chiheon Kim
Minsu Cho
Wook-Shin Han
261
19
0
09 Oct 2023
Perceptual Artifacts Localization for Image Synthesis Tasks
Perceptual Artifacts Localization for Image Synthesis TasksIEEE International Conference on Computer Vision (ICCV), 2023
Lingzhi Zhang
Zhengjie Xu
Connelly Barnes
Yuqian Zhou
Qing Liu
Chentao Song
Sohrab Amirghodsi
Zhe Lin
Eli Shechtman
Jianbo Shi
DiffM
246
39
0
09 Oct 2023
IPDreamer: Appearance-Controllable 3D Object Generation with Complex
  Image Prompts
IPDreamer: Appearance-Controllable 3D Object Generation with Complex Image PromptsInternational Conference on Learning Representations (ICLR), 2023
Bo-Wen Zeng
Shanglin Li
Yutang Feng
Ling Yang
Hong Li
...
Conghui He
Wentao Zhang
Jianzhuang Liu
Baochang Zhang
Shuicheng Yan
DiffM
289
7
0
09 Oct 2023
The Emergence of Reproducibility and Generalizability in Diffusion
  Models
The Emergence of Reproducibility and Generalizability in Diffusion Models
Huijie Zhang
Jinfan Zhou
Yifu Lu
Minzhe Guo
Peng Wang
Liyue Shen
Qing Qu
DiffM
302
15
0
08 Oct 2023
Assessing Robustness via Score-Based Adversarial Image Generation
Assessing Robustness via Score-Based Adversarial Image Generation
Marcel Kollovieh
Lukas Gosch
Yan Scholten
Marten Lienen
Leo Schwinn
Stephan Günnemann
DiffM
550
6
0
06 Oct 2023
Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and
  Latent Diffusion
Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent DiffusionConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Anton Razzhigaev
Arseniy Shakhmatov
Anastasia Maltseva
V.Ya. Arkhipkin
Igor Pavlov
Ilya Ryabov
Angelina Kuts
Sergey Petrakov
Andrey Kuznetsov
Denis Dimitrov
334
118
0
05 Oct 2023
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable
  Diffusion Model
AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion ModelInternational Conference on Learning Representations (ICLR), 2023
Zibin Dong
Yifu Yuan
Jianye Hao
Fei Ni
Yao Mu
Yan Zheng
Yujing Hu
Tangjie Lv
Changjie Fan
Zhipeng Hu
255
40
0
03 Oct 2023
PixArt-$α$: Fast Training of Diffusion Transformer for
  Photorealistic Text-to-Image Synthesis
PixArt-ααα: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisInternational Conference on Learning Representations (ICLR), 2023
Junsong Chen
Jincheng Yu
Chongjian Ge
Lewei Yao
Enze Xie
...
Zhongdao Wang
James T. Kwok
Ping Luo
Huchuan Lu
Zhenguo Li
DiffM
600
680
0
30 Sep 2023
GAIA-1: A Generative World Model for Autonomous Driving
GAIA-1: A Generative World Model for Autonomous Driving
Masane Fuchi
Lloyd Russell
Hudson Yeo
Zak Murez
Hiroto Minami
Alex Kendall
Tomohiro Takagi
Gianluca Corrado
VGen
399
422
0
29 Sep 2023
AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive
  Computation
AdaDiff: Accelerating Diffusion Models through Step-Wise Adaptive ComputationEuropean Conference on Computer Vision (ECCV), 2023
Shengkun Tang
Yaqing Wang
Maksim Dzhigil
Yi Liang
Yongbin Li
Dongkuan Xu
253
9
0
29 Sep 2023
Denoising Diffusion Bridge Models
Denoising Diffusion Bridge ModelsInternational Conference on Learning Representations (ICLR), 2023
Linqi Zhou
Aaron Lou
Samar Khanna
Stefano Ermon
DiffM
422
129
0
29 Sep 2023
Text-to-3D using Gaussian Splatting
Text-to-3D using Gaussian SplattingComputer Vision and Pattern Recognition (CVPR), 2023
Manish Sharma
Moitreya Chatterjee
Yikai Wang
Huaping Liu
3DGS
494
330
0
28 Sep 2023
Dream the Impossible: Outlier Imagination with Diffusion Models
Dream the Impossible: Outlier Imagination with Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2023
Xuefeng Du
Yiyou Sun
Xiaojin Zhu
Shouqing Yang
332
87
0
23 Sep 2023
GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided
  Video DecodER
GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided Video DecodERNeural Information Processing Systems (NeurIPS), 2023
Mingzhen Sun
Weining Wang
Zihan Qin
Jiahui Sun
Si-Qing Chen
Qingbin Liu
DiffM
160
5
0
23 Sep 2023
DreamLLM: Synergistic Multimodal Comprehension and Creation
DreamLLM: Synergistic Multimodal Comprehension and CreationInternational Conference on Learning Representations (ICLR), 2023
Runpei Dong
Chunrui Han
Yuang Peng
Zekun Qi
Zheng Ge
...
Hao-Ran Wei
Xiangwen Kong
Xiangyu Zhang
Kaisheng Ma
Li Yi
MLLM
299
275
0
20 Sep 2023
Cartoondiff: Training-free Cartoon Image Generation with Diffusion
  Transformer Models
Cartoondiff: Training-free Cartoon Image Generation with Diffusion Transformer ModelsIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023
Feihong He
Gang Li
Hui Xiong
Leilei Yan
Shimeng Hou
Hongwei Dong
Fanzhang Li
DiffM
236
9
0
15 Sep 2023
Large-Vocabulary 3D Diffusion Model with Transformer
Large-Vocabulary 3D Diffusion Model with TransformerInternational Conference on Learning Representations (ICLR), 2023
Ziang Cao
Fangzhou Hong
Tong Wu
Liang Pan
Ziwei Liu
DiffM
295
50
0
14 Sep 2023
SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models
SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2023
Shuchen Xue
Mingyang Yi
Weijian Luo
Shifeng Zhang
Jiacheng Sun
Hao Sun
Zhi-Ming Ma
DiffM
540
67
0
10 Sep 2023
MaskDiffusion: Boosting Text-to-Image Consistency with Conditional Mask
MaskDiffusion: Boosting Text-to-Image Consistency with Conditional MaskInternational Journal of Computer Vision (IJCV), 2023
Yupeng Zhou
Daquan Zhou
Zuo-Liang Zhu
Yaxing Wang
Qibin Hou
Jiashi Feng
174
13
0
08 Sep 2023
Relay Diffusion: Unifying diffusion process across resolutions for image
  synthesis
Relay Diffusion: Unifying diffusion process across resolutions for image synthesisInternational Conference on Learning Representations (ICLR), 2023
Jiayan Teng
Wendi Zheng
Ming Ding
Wenyi Hong
Jianqiao Wangni
Zhuoyi Yang
Jie Tang
DiffM
234
71
0
04 Sep 2023
VideoGen: A Reference-Guided Latent Diffusion Approach for High
  Definition Text-to-Video Generation
VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation
Xin Li
Wenqing Chu
Ye Wu
Weihang Yuan
Fanglong Liu
Tao Gui
Fu Li
Haocheng Feng
Errui Ding
Jingdong Wang
VGen
332
70
0
01 Sep 2023
Elucidating the Exposure Bias in Diffusion Models
Elucidating the Exposure Bias in Diffusion ModelsInternational Conference on Learning Representations (ICLR), 2023
Mang Ning
Mingxiao Li
Jianlin Su
A. A. Salah
Itir Onal Ertugrul
DiffM
520
73
0
29 Aug 2023
Towards Large-scale 3D Representation Learning with Multi-dataset Point
  Prompt Training
Towards Large-scale 3D Representation Learning with Multi-dataset Point Prompt TrainingComputer Vision and Pattern Recognition (CVPR), 2023
Xiaoyang Wu
Zhuotao Tian
Xin Wen
Bohao Peng
Xihui Liu
Kaicheng Yu
Hengshuang Zhao
206
77
0
18 Aug 2023
Learning A Coarse-to-Fine Diffusion Transformer for Image Restoration
Learning A Coarse-to-Fine Diffusion Transformer for Image Restoration
Liyan Wang
Qinyu Yang
Cong Wang
Wen Wang
Jin-shan Pan
Zhixun Su
DiffM
226
6
0
17 Aug 2023
Accelerating Diffusion-based Combinatorial Optimization Solvers by
  Progressive Distillation
Accelerating Diffusion-based Combinatorial Optimization Solvers by Progressive Distillation
Junwei Huang
Zhiqing Sun
Yiming Yang
DiffM
124
6
0
12 Aug 2023
Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model
Fan Zhang
Naye Ji
Fuxing Gao
Siyuan Zhao
Zhaohan Wang
Shunman Li
249
0
0
11 Aug 2023
The Paradigm Shifts in Artificial Intelligence
The Paradigm Shifts in Artificial IntelligenceCommunications of the ACM (CACM), 2023
V. Dhar
AI4TSAI4CE
152
8
0
02 Aug 2023
Memory Encoding Model
Memory Encoding Model
Huzheng Yang
James C. Gee
Jianbo Shi
135
7
0
02 Aug 2023
Understanding the Latent Space of Diffusion Models through the Lens of
  Riemannian Geometry
Understanding the Latent Space of Diffusion Models through the Lens of Riemannian GeometryNeural Information Processing Systems (NeurIPS), 2023
Yong-Hyun Park
Mingi Kwon
J. Choi
Junghyo Jo
Youngjung Uh
DiffM
447
109
0
24 Jul 2023
Diffusion Sampling with Momentum for Mitigating Divergence Artifacts
Diffusion Sampling with Momentum for Mitigating Divergence ArtifactsInternational Conference on Learning Representations (ICLR), 2023
Suttisak Wizadwongsa
Worameth Chinchuthakun
Pramook Khungurn
Amit Raj
Supasorn Suwajanakorn
DiffM
291
2
0
20 Jul 2023
BSDM: Background Suppression Diffusion Model for Hyperspectral Anomaly
  Detection
BSDM: Background Suppression Diffusion Model for Hyperspectral Anomaly Detection
Haonan Qin
Weiying Xie
Yunsong Li
Leyuan Fang
DiffM
111
17
0
19 Jul 2023
Flow Matching in Latent Space
Flow Matching in Latent Space
Quan Dao
Hao Phung
Binh Duc Nguyen
Anh Tran
362
110
0
17 Jul 2023
Complexity Matters: Rethinking the Latent Space for Generative Modeling
Complexity Matters: Rethinking the Latent Space for Generative ModelingNeural Information Processing Systems (NeurIPS), 2023
Tianyang Hu
Fei Chen
Hong Wang
Jiawei Li
Wei Cao
Jiacheng Sun
Hao Sun
DiffM
320
17
0
17 Jul 2023
DreamTeacher: Pretraining Image Backbones with Deep Generative Models
DreamTeacher: Pretraining Image Backbones with Deep Generative ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Daiqing Li
Huan Ling
Amlan Kar
David Acuna
Seung Wook Kim
Karsten Kreis
Antonio Torralba
Sanja Fidler
VLMDiffM
266
34
0
14 Jul 2023
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models
  without Specific Tuning
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific TuningInternational Conference on Learning Representations (ICLR), 2023
Yuwei Guo
Ceyuan Yang
Anyi Rao
Zhengyang Liang
Yaohui Wang
Yu Qiao
Maneesh Agrawala
Dahua Lin
Bo Dai
VGen
946
1,296
0
10 Jul 2023
Detecting Images Generated by Deep Diffusion Models using their Local Intrinsic Dimensionality
P. Lorenz
Ricard Durall
J. Keuper
DiffM
615
52
0
05 Jul 2023
SVDM: Single-View Diffusion Model for Pseudo-Stereo 3D Object Detection
SVDM: Single-View Diffusion Model for Pseudo-Stereo 3D Object Detection
Yuguang Shi
DiffM
258
2
0
05 Jul 2023
SDXL: Improving Latent Diffusion Models for High-Resolution Image
  Synthesis
SDXL: Improving Latent Diffusion Models for High-Resolution Image SynthesisInternational Conference on Learning Representations (ICLR), 2023
Dustin Podell
Zion English
Kyle Lacey
A. Blattmann
Tim Dockhorn
Jonas Muller
Joe Penna
Robin Rombach
1.7K
3,891
0
04 Jul 2023
DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation
DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape GenerationNeural Information Processing Systems (NeurIPS), 2023
Shentong Mo
Enze Xie
Ruihang Chu
Lewei Yao
Lanqing Hong
Matthias Nießner
Zhenguo Li
194
110
0
04 Jul 2023
Spiking Denoising Diffusion Probabilistic Models
Spiking Denoising Diffusion Probabilistic ModelsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Jiahang Cao
Ziqing Wang
Hanzhong Guo
Haotai Cheng
Qiang Zhang
Renjing Xu
DiffM
307
22
0
29 Jun 2023
Federated Generative Learning with Foundation Models
Federated Generative Learning with Foundation Models
Jie Zhang
Xiaohua Qi
Bo Zhao
FedML
295
28
0
28 Jun 2023
Diffusion with Forward Models: Solving Stochastic Inverse Problems
  Without Direct Supervision
Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct SupervisionNeural Information Processing Systems (NeurIPS), 2023
A. Tewari
Tianwei Yin
George Cazenavette
Semon Rezchikov
J. Tenenbaum
F. Durand
William T. Freeman
Vincent Sitzmann
DiffM
411
113
0
20 Jun 2023
EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model
EMoG: Synthesizing Emotive Co-speech 3D Gesture with Diffusion Model
Li-Ping Yin
Yijun Wang
Tianyu He
Jinming Liu
Wei Zhao
Bohan Li
Xin Jin
Jianxin Lin
DiffM
187
21
0
20 Jun 2023
Masked Diffusion Models Are Fast Distribution Learners
Masked Diffusion Models Are Fast Distribution Learners
Jiachen Lei
Qinglong Wang
Pengyu Cheng
Zhongjie Ba
Zhan Qin
Peng Kuang
Zhenguang Liu
Kui Ren
DiffM
457
4
0
20 Jun 2023
ArtFusion: Controllable Arbitrary Style Transfer using Dual Conditional
  Latent Diffusion Models
ArtFusion: Controllable Arbitrary Style Transfer using Dual Conditional Latent Diffusion Models
Da Chen
DiffM
224
6
0
15 Jun 2023
Fast Training of Diffusion Models with Masked Transformers
Fast Training of Diffusion Models with Masked Transformers
Hongkai Zheng
Weili Nie
Arash Vahdat
Anima Anandkumar
DiffM
324
132
0
15 Jun 2023
Conditional Human Sketch Synthesis with Explicit Abstraction Control
Conditional Human Sketch Synthesis with Explicit Abstraction Control
Da Chen
DiffM
149
1
0
15 Jun 2023
DiffAug: A Diffuse-and-Denoise Augmentation for Training Robust
  Classifiers
DiffAug: A Diffuse-and-Denoise Augmentation for Training Robust ClassifiersNeural Information Processing Systems (NeurIPS), 2023
Chandramouli Shama Sastry
Sri Harsha Dumpala
Sageev Oore
279
3
0
15 Jun 2023
Extraction and Recovery of Spatio-Temporal Structure in Latent Dynamics
  Alignment with Diffusion Models
Extraction and Recovery of Spatio-Temporal Structure in Latent Dynamics Alignment with Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2023
Yule Wang
Zijing Wu
Chengrui Li
Anqi Wu
DiffM
325
15
0
09 Jun 2023
BOOT: Data-free Distillation of Denoising Diffusion Models with
  Bootstrapping
BOOT: Data-free Distillation of Denoising Diffusion Models with Bootstrapping
Jiatao Gu
Shuangfei Zhai
Yizhe Zhang
Lingjie Liu
J. Susskind
DiffM
216
94
0
08 Jun 2023
Previous
123...52535455
Next
Page 53 of 55
Pageof 55