ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.09748
  4. Cited By
Scalable Diffusion Models with Transformers
v1v2 (latest)

Scalable Diffusion Models with Transformers

IEEE International Conference on Computer Vision (ICCV), 2022
19 December 2022
William S. Peebles
Saining Xie
    GNN
ArXiv (abs)PDFHTMLHuggingFace (18 upvotes)

Papers citing "Scalable Diffusion Models with Transformers"

50 / 2,711 papers shown
On the Design Fundamentals of Diffusion Models: A Survey
On the Design Fundamentals of Diffusion Models: A SurveyPattern Recognition (Pattern Recogn.), 2023
Ziyi Chang
George Alex Koulieris
Hyung Jin Chang
Hubert P. H. Shum
DiffM
624
79
0
07 Jun 2023
PLANNER: Generating Diversified Paragraph via Latent Language Diffusion
  Model
PLANNER: Generating Diversified Paragraph via Latent Language Diffusion ModelNeural Information Processing Systems (NeurIPS), 2023
Yizhe Zhang
Jiatao Gu
Zhuofeng Wu
Shuangfei Zhai
J. Susskind
Navdeep Jaitly
DiffM
389
44
0
05 Jun 2023
SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two
  Seconds
SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two SecondsNeural Information Processing Systems (NeurIPS), 2023
Yanyu Li
Huan Wang
Qing Jin
Ju Hu
Pavlo Chemerys
Yun Fu
Yanzhi Wang
Sergey Tulyakov
Jian Ren
VLM
339
234
0
01 Jun 2023
Coneheads: Hierarchy Aware Attention
Coneheads: Hierarchy Aware AttentionNeural Information Processing Systems (NeurIPS), 2023
Albert Tseng
Tao Yu
Toni J.B. Liu
Chris De Sa
3DPC
264
7
0
01 Jun 2023
Addressing Negative Transfer in Diffusion Models
Addressing Negative Transfer in Diffusion ModelsNeural Information Processing Systems (NeurIPS), 2023
Hyojun Go
Jinyoung Kim
Yunsung Lee
Seunghyun Lee
Shinhyeok Oh
Hyeongdon Moon
Seungtaek Choi
DiffMVLM
545
31
0
01 Jun 2023
Humans in 4D: Reconstructing and Tracking Humans with Transformers
Humans in 4D: Reconstructing and Tracking Humans with TransformersIEEE International Conference on Computer Vision (ICCV), 2023
Shubham Goel
Georgios Pavlakos
Jathushan Rajasegaran
Angjoo Kanazawa
Jitendra Malik
3DH
387
309
0
31 May 2023
A Unified Framework for U-Net Design and Analysis
A Unified Framework for U-Net Design and AnalysisNeural Information Processing Systems (NeurIPS), 2023
Christopher Williams
Fabian Falck
George Deligiannidis
Chris Holmes
Arnaud Doucet
Saifuddin Syed
SSegAI4CE
234
62
0
31 May 2023
Nested Diffusion Processes for Anytime Image Generation
Nested Diffusion Processes for Anytime Image GenerationIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2023
Noam Elata
Bahjat Kawar
T. Michaeli
Michael Elad
DiffM
251
6
0
30 May 2023
Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation
Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation
Jia-Bin Huang
Yi Ren
Rongjie Huang
Dongchao Yang
Zhenhui Ye
Chen Zhang
Jinglin Liu
Xiang Yin
Zejun Ma
Zhou Zhao
DiffM
205
98
0
29 May 2023
Diffusion Model is an Effective Planner and Data Synthesizer for
  Multi-Task Reinforcement Learning
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2023
Haoran He
Chenjia Bai
Kang Xu
Zhuoran Yang
Weinan Zhang
Dong Wang
Bingyan Zhao
Xuelong Li
DiffMOffRL
352
135
0
29 May 2023
UDPM: Upsampling Diffusion Probabilistic Models
UDPM: Upsampling Diffusion Probabilistic ModelsNeural Information Processing Systems (NeurIPS), 2023
Shady Abu Hussein
Raja Giryes
DiffM
394
4
0
25 May 2023
Knowledge Diffusion for Distillation
Knowledge Diffusion for DistillationNeural Information Processing Systems (NeurIPS), 2023
Tao Huang
Yuan Zhang
Mingkai Zheng
Shan You
Fei Wang
Chao Qian
Chang Xu
323
87
0
25 May 2023
T1: Scaling Diffusion Probabilistic Fields to High-Resolution on Unified
  Visual Modalities
T1: Scaling Diffusion Probabilistic Fields to High-Resolution on Unified Visual Modalities
Kangfu Mei
Mo Zhou
Vishal M. Patel
DiffM
331
1
0
24 May 2023
VDT: General-purpose Video Diffusion Transformers via Mask Modeling
VDT: General-purpose Video Diffusion Transformers via Mask ModelingInternational Conference on Learning Representations (ICLR), 2023
Haoyu Lu
Guoxing Yang
Nanyi Fei
Yuqi Huo
Zhiwu Lu
Ping Luo
Mingyu Ding
DiffMVGen
223
99
0
22 May 2023
U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech
U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech
Xin Jing
Yi Chang
Zijiang Yang
Jiang-jian Xie
Andreas Triantafyllopoulos
Bjoern W. Schuller
206
11
0
22 May 2023
Is Synthetic Data From Diffusion Models Ready for Knowledge
  Distillation?
Is Synthetic Data From Diffusion Models Ready for Knowledge Distillation?
Zheng Li
Yuxuan Li
Penghai Zhao
Renjie Song
Xiang Li
Jian Yang
197
24
0
22 May 2023
ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer
ViT-TTS: Visual Text-to-Speech with Scalable Diffusion TransformerConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Huadai Liu
Rongjie Huang
Xuan Lin
Wenqiang Xu
Maozong Zheng
Hong Chen
Jinzheng He
Zhou Zhao
DiffM
333
31
0
22 May 2023
Guided Motion Diffusion for Controllable Human Motion Synthesis
Guided Motion Diffusion for Controllable Human Motion SynthesisIEEE International Conference on Computer Vision (ICCV), 2023
Korrawe Karunratanakul
Konpat Preechakul
Supasorn Suwajanakorn
Siyu Tang
DiffM
425
204
0
21 May 2023
Learning Joint 2D & 3D Diffusion Models for Complete Molecule Generation
Learning Joint 2D & 3D Diffusion Models for Complete Molecule Generation
Han Huang
Leilei Sun
Bowen Du
Weifeng Lv
DiffM
286
22
0
21 May 2023
LaCon: Late-Constraint Diffusion for Steerable Guided Image Synthesis
LaCon: Late-Constraint Diffusion for Steerable Guided Image Synthesis
Yu Xie
Rui Li
Kaidong Zhang
Xin Luo
Dong Liu
DiffM
464
5
0
19 May 2023
Controllable Mind Visual Diffusion Model
Controllable Mind Visual Diffusion ModelAAAI Conference on Artificial Intelligence (AAAI), 2023
Bo-Wen Zeng
Shanglin Li
Xuhui Liu
Sicheng Gao
Xiaolong Jiang
Xu Tang
Feng-Long Xie
Jianzhuang Liu
Baochang Zhang
DiffM
223
37
0
17 May 2023
Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed
  Opportunity
Parameter-Efficient Fine-Tuning for Medical Image Analysis: The Missed OpportunityInternational Conference on Medical Imaging with Deep Learning (MIDL), 2023
Raman Dutt
Linus Ericsson
Pedro Sanchez
Sotirios A. Tsaftaris
Timothy M. Hospedales
MedIm
482
73
0
14 May 2023
Visual Tuning
Visual TuningACM Computing Surveys (ACM Comput. Surv.), 2023
Bruce X. B. Yu
Jianlong Chang
Haixin Wang
Lin Liu
Shijie Wang
...
Lingxi Xie
Haojie Li
Zhouchen Lin
Qi Tian
Chang Wen Chen
VLM
438
59
0
10 May 2023
BoDiffusion: Diffusing Sparse Observations for Full-Body Human Motion
  Synthesis
BoDiffusion: Diffusing Sparse Observations for Full-Body Human Motion Synthesis
Angela Castillo
María Escobar
Guillaume Jeanneret
Albert Pumarola
Pablo Arbelaez
Ali K. Thabet
A. Sanakoyeu
DiffMVGen
217
39
0
21 Apr 2023
Refusion: Enabling Large-Size Realistic Image Restoration with
  Latent-Space Diffusion Models
Refusion: Enabling Large-Size Realistic Image Restoration with Latent-Space Diffusion Models
Ziwei Luo
Fredrik K. Gustafsson
Zhengli Zhao
Jens Sjölund
Thomas B. Schon
188
160
0
17 Apr 2023
Control3Diff: Learning Controllable 3D Diffusion Models from Single-view
  Images
Control3Diff: Learning Controllable 3D Diffusion Models from Single-view ImagesInternational Conference on 3D Vision (3DV), 2023
Jiatao Gu
Qingzhe Gao
Shuangfei Zhai
Baoquan Chen
Lingjie Liu
J. Susskind
267
35
0
13 Apr 2023
DiffFit: Unlocking Transferability of Large Diffusion Models via Simple
  Parameter-Efficient Fine-Tuning
DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-TuningIEEE International Conference on Computer Vision (ICCV), 2023
Enze Xie
Lewei Yao
Han Shi
Zhili Liu
Daquan Zhou
Zhaoqiang Liu
Jiawei Li
Zhenguo Li
615
91
0
13 Apr 2023
Intriguing properties of synthetic images: from generative adversarial
  networks to diffusion models
Intriguing properties of synthetic images: from generative adversarial networks to diffusion models
Riccardo Corvi
D. Cozzolino
Giovanni Poggi
Koki Nagano
L. Verdoliva
DiffM
313
142
0
13 Apr 2023
Revisiting the Evaluation of Image Synthesis with GANs
Revisiting the Evaluation of Image Synthesis with GANsNeural Information Processing Systems (NeurIPS), 2023
Mengping Yang
Ceyuan Yang
Yichi Zhang
Qingyan Bai
Yujun Shen
Bo Dai
EGVM
278
10
0
04 Apr 2023
Your Diffusion Model is Secretly a Zero-Shot Classifier
Your Diffusion Model is Secretly a Zero-Shot ClassifierIEEE International Conference on Computer Vision (ICCV), 2023
Alexander C. Li
Mihir Prabhudesai
Shivam Duggal
Ellis L Brown
Deepak Pathak
DiffMVLM
686
309
0
28 Mar 2023
The Stable Signature: Rooting Watermarks in Latent Diffusion Models
The Stable Signature: Rooting Watermarks in Latent Diffusion ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Pierre Fernandez
Guillaume Couairon
Edouard Grave
Matthijs Douze
Teddy Furon
WIGM
327
298
0
27 Mar 2023
PDPP: Projected Diffusion for Procedure Planning in Instructional Videos
PDPP: Projected Diffusion for Procedure Planning in Instructional VideosComputer Vision and Pattern Recognition (CVPR), 2023
Hanlin Wang
Yilu Wu
Sheng Guo
Limin Wang
VGenDiffM
423
34
0
26 Mar 2023
MDTv2: Masked Diffusion Transformer is a Strong Image Synthesizer
MDTv2: Masked Diffusion Transformer is a Strong Image SynthesizerIEEE International Conference on Computer Vision (ICCV), 2023
Shanghua Gao
Pan Zhou
Mingg-Ming Cheng
Shuicheng Yan
DiffM
1.1K
248
0
25 Mar 2023
CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion
CompoDiff: Versatile Composed Image Retrieval With Latent Diffusion
Geonmo Gu
Sanghyuk Chun
Wonjae Kim
HeeJae Jun
Yoohoon Kang
Sangdoo Yun
DiffM
550
77
0
21 Mar 2023
Polynomial Implicit Neural Representations For Large Diverse Datasets
Polynomial Implicit Neural Representations For Large Diverse DatasetsComputer Vision and Pattern Recognition (CVPR), 2023
Rajhans Singh
Ankita Shukla
Pavan Turaga
AI4CE
202
28
0
20 Mar 2023
SVDiff: Compact Parameter Space for Diffusion Fine-Tuning
SVDiff: Compact Parameter Space for Diffusion Fine-TuningIEEE International Conference on Computer Vision (ICCV), 2023
Ligong Han
Yinxiao Li
Han Zhang
P. Milanfar
Dimitris N. Metaxas
Feng Yang
DiffM
668
367
0
20 Mar 2023
Denoising Diffusion Autoencoders are Unified Self-supervised Learners
Denoising Diffusion Autoencoders are Unified Self-supervised LearnersIEEE International Conference on Computer Vision (ICCV), 2023
Weilai Xiang
Hongyu Yang
Di Huang
Yunhong Wang
DiffM
465
119
0
17 Mar 2023
Efficient Diffusion Training via Min-SNR Weighting Strategy
Efficient Diffusion Training via Min-SNR Weighting StrategyIEEE International Conference on Computer Vision (ICCV), 2023
Tiankai Hang
Shuyang Gu
Chen Li
Jianmin Bao
Dong Chen
Han Hu
Xin Geng
B. Guo
305
220
0
16 Mar 2023
ResDiff: Combining CNN and Diffusion Model for Image Super-Resolution
ResDiff: Combining CNN and Diffusion Model for Image Super-ResolutionAAAI Conference on Artificial Intelligence (AAAI), 2023
Shuyao Shang
Zhengyang Shan
Guangxing Liu
LunQian Wang
XingHua Wang
Zekai Zhang
Jingling Zhang
DiffM
277
136
0
15 Mar 2023
Editing Implicit Assumptions in Text-to-Image Diffusion Models
Editing Implicit Assumptions in Text-to-Image Diffusion ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Hadas Orgad
Bahjat Kawar
Yonatan Belinkov
DiffM
363
115
0
14 Mar 2023
Scaling up GANs for Text-to-Image Synthesis
Scaling up GANs for Text-to-Image SynthesisComputer Vision and Pattern Recognition (CVPR), 2023
Minguk Kang
Jun-Yan Zhu
Richard Y. Zhang
Jaesik Park
Eli Shechtman
Sylvain Paris
Taesung Park
325
597
0
09 Mar 2023
TRACT: Denoising Diffusion Models with Transitive Closure
  Time-Distillation
TRACT: Denoising Diffusion Models with Transitive Closure Time-Distillation
David Berthelot
Arnaud Autef
Jierui Lin
Dian Ang Yap
Shuangfei Zhai
Siyuan Hu
Daniel Zheng
Walter Talbot
Eric Gu
DiffM
249
119
0
07 Mar 2023
DLT: Conditioned layout generation with Joint Discrete-Continuous
  Diffusion Layout Transformer
DLT: Conditioned layout generation with Joint Discrete-Continuous Diffusion Layout TransformerIEEE International Conference on Computer Vision (ICCV), 2023
Elad Levi
Eli Brosh
Mykola Mykhailych
Meir Perez
DiffM
201
27
0
07 Mar 2023
Understanding Diffusion Objectives as the ELBO with Simple Data
  Augmentation
Understanding Diffusion Objectives as the ELBO with Simple Data AugmentationNeural Information Processing Systems (NeurIPS), 2023
Diederik P. Kingma
Ruiqi Gao
DiffM
744
238
0
01 Mar 2023
Unlimited-Size Diffusion Restoration
Unlimited-Size Diffusion Restoration
Yinhuai Wang
Jiwen Yu
Runyi Yu
Jian Zhang
192
16
0
01 Mar 2023
Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few
  Labels
Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few LabelsNeural Information Processing Systems (NeurIPS), 2023
Zebin You
Yong Zhong
Fan Bao
Jiacheng Sun
Chongxuan Li
Jun Zhu
DiffMVLM
528
50
0
21 Feb 2023
A Reparameterized Discrete Diffusion Model for Text Generation
A Reparameterized Discrete Diffusion Model for Text Generation
Lin Zheng
Jianbo Yuan
Lei Yu
Lingpeng Kong
DiffM
285
114
0
11 Feb 2023
Q-Diffusion: Quantizing Diffusion Models
Q-Diffusion: Quantizing Diffusion ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Xiuyu Li
Yijia Liu
Long Lian
Hua Yang
Zhen Dong
Daniel Kang
Shanghang Zhang
Kurt Keutzer
DiffMMQ
374
237
0
08 Feb 2023
Structure and Content-Guided Video Synthesis with Diffusion Models
Structure and Content-Guided Video Synthesis with Diffusion ModelsIEEE International Conference on Computer Vision (ICCV), 2023
Patrick Esser
Johnathan Chiu
Parmida Atighehchian
Jonathan Granskog
Anastasis Germanidis
DiffMVGen
379
663
0
06 Feb 2023
Diffusion Models as Artists: Are we Closing the Gap between Humans and
  Machines?
Diffusion Models as Artists: Are we Closing the Gap between Humans and Machines?International Conference on Machine Learning (ICML), 2023
Victor Boutin
Thomas Fel
Lakshya Singhal
Rishav Mukherji
Akash Nagaraj
Julien Colin
Thomas Serre
DiffM
274
10
0
27 Jan 2023
Previous
123...535455
Next