ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2312.02696
  4. Cited By
Analyzing and Improving the Training Dynamics of Diffusion Models

Analyzing and Improving the Training Dynamics of Diffusion Models

5 December 2023
Tero Karras
M. Aittala
J. Lehtinen
Janne Hellsten
Timo Aila
S. Laine
ArXivPDFHTML

Papers citing "Analyzing and Improving the Training Dynamics of Diffusion Models"

50 / 123 papers shown
Title
DDAE++: Enhancing Diffusion Models Towards Unified Generative and Discriminative Learning
DDAE++: Enhancing Diffusion Models Towards Unified Generative and Discriminative Learning
Weilai Xiang
Hongyu Yang
Di Huang
Yunhong Wang
14
0
0
16 May 2025
LipDiffuser: Lip-to-Speech Generation with Conditional Diffusion Models
LipDiffuser: Lip-to-Speech Generation with Conditional Diffusion Models
Danilo de Oliveira
Julius Richter
Tal Peer
Timo Germann
DiffM
17
0
0
16 May 2025
Score-based diffusion nowcasting of GOES imagery
Score-based diffusion nowcasting of GOES imagery
Randy J. Chase
Katherine Haynes
Lander Ver Hoef
Imme Ebert-Uphoff
DiffM
26
0
0
15 May 2025
Whitened Score Diffusion: A Structured Prior for Imaging Inverse Problems
Whitened Score Diffusion: A Structured Prior for Imaging Inverse Problems
Jeffrey Alido
Tongyu Li
Yu Sun
Lei Tian
DiffM
MedIm
19
0
0
15 May 2025
DiffLocks: Generating 3D Hair from a Single Image using Diffusion Models
DiffLocks: Generating 3D Hair from a Single Image using Diffusion Models
Radu Alexandru Rosu
Keyu Wu
Yao Feng
Youyi Zheng
M. Black
DiffM
3DH
49
0
0
09 May 2025
Normalize Everything: A Preconditioned Magnitude-Preserving Architecture for Diffusion-Based Speech Enhancement
Normalize Everything: A Preconditioned Magnitude-Preserving Architecture for Diffusion-Based Speech Enhancement
Julius Richter
Danilo de Oliveira
Timo Gerkmann
DiffM
55
0
0
08 May 2025
No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves
No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves
D. Jiang
Mengmeng Wang
Liuzhuozheng Li
Lei Zhang
Haoyu Wang
Wei Wei
Guang Dai
Yanning Zhang
Jingdong Wang
DiffM
51
0
0
05 May 2025
Generative diffusion model surrogates for mechanistic agent-based biological models
Generative diffusion model surrogates for mechanistic agent-based biological models
Tien Comlekoglu
J. Q. Toledo-Marín
Douglas W. DeSimone
Shayn M. Peirce
Geoffrey C. Fox
J. Glazier
DiffM
MedIm
48
1
0
01 May 2025
Can We Achieve Efficient Diffusion without Self-Attention? Distilling Self-Attention into Convolutions
Can We Achieve Efficient Diffusion without Self-Attention? Distilling Self-Attention into Convolutions
Ziyi Dong
Chengxing Zhou
Weijian Deng
Pengxu Wei
Xiangyang Ji
Liang Lin
MQ
53
0
0
30 Apr 2025
Revisiting Diffusion Autoencoder Training for Image Reconstruction Quality
Revisiting Diffusion Autoencoder Training for Image Reconstruction Quality
Pramook Khungurn
Sukit Seripanitkarn
Phonphrm Thawatdamrongkit
Supasorn Suwajanakorn
DiffM
77
0
0
30 Apr 2025
A Langevin sampling algorithm inspired by the Adam optimizer
A Langevin sampling algorithm inspired by the Adam optimizer
B. Leimkuhler
René Lohmann
P. Whalley
79
0
0
26 Apr 2025
Entropic Time Schedulers for Generative Diffusion Models
Entropic Time Schedulers for Generative Diffusion Models
Dejan Stancevic
Luca Ambrogioni
DiffM
OOD
51
0
0
18 Apr 2025
VideoPanda: Video Panoramic Diffusion with Multi-view Attention
VideoPanda: Video Panoramic Diffusion with Multi-view Attention
Kevin Xie
Amirmojtaba Sabour
Jiahui Huang
Despoina Paschalidou
G. Klár
Umar Iqbal
Sanja Fidler
Fangyin Wei
VGen
MDE
39
0
0
15 Apr 2025
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Ziqi Pang
Xin Xu
Yu-Xiong Wang
DiffM
67
0
0
15 Apr 2025
Efficient Generative Model Training via Embedded Representation Warmup
Efficient Generative Model Training via Embedded Representation Warmup
Deyuan Liu
Peng Sun
Xufeng Li
Tao Lin
33
0
0
14 Apr 2025
InvFussion: Bridging Supervised and Zero-shot Diffusion for Inverse Problems
InvFussion: Bridging Supervised and Zero-shot Diffusion for Inverse Problems
Noam Elata
Hyungjin Chung
Jong Chul Ye
T. Michaeli
Michael Elad
DiffM
37
0
0
02 Apr 2025
Domain Guidance: A Simple Transfer Approach for a Pre-trained Diffusion Model
Domain Guidance: A Simple Transfer Approach for a Pre-trained Diffusion Model
Jincheng Zhong
Xiangcheng Zhang
J. Z. Wang
Mingsheng Long
38
1
0
02 Apr 2025
MMGen: Unified Multi-modal Image Generation and Understanding in One Go
MMGen: Unified Multi-modal Image Generation and Understanding in One Go
Jiepeng Wang
Zhaoqing Wang
H. Pan
Yuan Liu
Dongdong Yu
Changhu Wang
Wenping Wang
DiffM
80
0
0
26 Mar 2025
PCM : Picard Consistency Model for Fast Parallel Sampling of Diffusion Models
PCM : Picard Consistency Model for Fast Parallel Sampling of Diffusion Models
Junhyuk So
Jiwoong Shin
Chaeyeon Jang
Eunhyeok Park
DiffM
48
0
0
25 Mar 2025
Uncertainty-guided Perturbation for Image Super-Resolution Diffusion Model
Uncertainty-guided Perturbation for Image Super-Resolution Diffusion Model
Leheng Zhang
Weiyi You
Kexuan Shi
Shuhang Gu
62
0
0
24 Mar 2025
Guidance Free Image Editing via Explicit Conditioning
Guidance Free Image Editing via Explicit Conditioning
Mehdi Noroozi
Alberto Gil C. P. Ramos
Luca Morreale
Ruchika Chavhan
Malcolm Chadwick
Abhinav Mehrotra
Sourav Bhattacharya
DiffM
56
0
0
22 Mar 2025
Scale-wise Distillation of Diffusion Models
Scale-wise Distillation of Diffusion Models
Nikita Starodubcev
Denis Kuznedelev
Artem Babenko
Dmitry Baranchuk
DiffM
53
0
0
20 Mar 2025
Training Video Foundation Models with NVIDIA NeMo
Training Video Foundation Models with NVIDIA NeMo
Zeeshan Patel
Ethan He
Parth Mannan
Xiaowei Ren
Ryan Wolf
...
Rong Ou
Pallab Bhattacharya
David Page
Nima Tajbakhsh
Ashwath Aithal
VGen
43
0
0
17 Mar 2025
Probabilistic Forecasting for Dynamical Systems with Missing or Imperfect Data
Probabilistic Forecasting for Dynamical Systems with Missing or Imperfect Data
Siddharth Rout
Eldad Haber
Stéphane Gaudreault
AI4TS
AI4CE
67
0
0
15 Mar 2025
Flow to the Mode: Mode-Seeking Diffusion Autoencoders for State-of-the-Art Image Tokenization
Flow to the Mode: Mode-Seeking Diffusion Autoencoders for State-of-the-Art Image Tokenization
Kyle Sargent
Kyle Hsu
Justin Johnson
L. Fei-Fei
Jiajun Wu
DiffM
MU
58
3
0
14 Mar 2025
AugGen: Synthetic Augmentation Can Improve Discriminative Models
Parsa Rahimi
Damien Teney
S´ebastien Marcel
69
0
0
14 Mar 2025
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation
Junsong Chen
Shuchen Xue
Yuyang Zhao
Jincheng Yu
Sayak Paul
Junyu Chen
Han Cai
E. Xie
Enze Xie
VLM
66
2
0
12 Mar 2025
Accelerating Diffusion Sampling via Exploiting Local Transition Coherence
Shangwen Zhu
Han Zhang
Zhantao Yang
Qianyu Peng
Zhao Pu
Haoran Wang
Fan Cheng
DiffM
48
0
0
12 Mar 2025
Reconstruct Anything Model: a lightweight foundation model for computational imaging
Reconstruct Anything Model: a lightweight foundation model for computational imaging
M. Terris
Samuel Hurault
Maxime Song
Julian Tachella
MedIm
DiffM
70
2
0
11 Mar 2025
Effective and Efficient Masked Image Generation Models
Effective and Efficient Masked Image Generation Models
Zebin You
Jingyang Ou
Xiaolu Zhang
Jun Hu
Jun Zhou
Chongxuan Li
DiffM
VLM
64
1
0
10 Mar 2025
Efficient Distillation of Classifier-Free Guidance using Adapters
Cristian Perez Jensen
Seyedmorteza Sadat
53
1
0
10 Mar 2025
Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator
Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator
Kaiwen Zheng
Yongxin Chen
Huayu Chen
Guande He
Xuan Li
Jun Zhu
Qinsheng Zhang
DiffM
49
0
0
03 Mar 2025
Generative Human Geometry Distribution
Generative Human Geometry Distribution
Xiangjun Tang
Biao Zhang
Peter Wonka
3DH
55
0
0
03 Mar 2025
Foundation Inference Models for Stochastic Differential Equations: A Transformer-based Approach for Zero-shot Function Estimation
Foundation Inference Models for Stochastic Differential Equations: A Transformer-based Approach for Zero-shot Function Estimation
Patrick Seifner
K. Cvejoski
David Berghaus
C. Ojeda
Ramses J. Sanchez
DiffM
53
1
0
26 Feb 2025
Training Consistency Models with Variational Noise Coupling
Training Consistency Models with Variational Noise Coupling
Gianluigi Silvestri
L. Ambrogioni
Chieh-Hsin Lai
Yuhta Takida
Yuki Mitsufuji
90
1
0
25 Feb 2025
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hyperspherical Normalization for Scalable Deep Reinforcement Learning
Hojoon Lee
Youngdo Lee
Takuma Seno
Donghu Kim
Peter Stone
Jaegul Choo
63
1
0
24 Feb 2025
Understanding Classifier-Free Guidance: High-Dimensional Theory and Non-Linear Generalizations
Understanding Classifier-Free Guidance: High-Dimensional Theory and Non-Linear Generalizations
Krunoslav Lehman Pavasovic
Jakob Verbeek
Giulio Biroli
Marc Mézard
64
0
0
11 Feb 2025
History-Guided Video Diffusion
Kiwhan Song
Boyuan Chen
Max Simchowitz
Yilun Du
Russ Tedrake
Vincent Sitzmann
VGen
117
7
0
10 Feb 2025
Devil is in the Details: Density Guidance for Detail-Aware Generation with Flow Models
Rafał Karczewski
Markus Heinonen
Vikas K. Garg
DiffM
47
0
0
09 Feb 2025
Beyond and Free from Diffusion: Invertible Guided Consistency Training
Chia-Hong Hsu
Shiu-hong Kao
Randall Balestriero
3DV
82
0
0
08 Feb 2025
SQ-DM: Accelerating Diffusion Models with Aggressive Quantization and Temporal Sparsity
Zichen Fan
Steve Dai
Rangharajan Venkatesan
Dennis Sylvester
Brucek Khailany
MQ
50
0
0
28 Jan 2025
CAFuser: Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes
CAFuser: Condition-Aware Multimodal Fusion for Robust Semantic Perception of Driving Scenes
Tim Broedermann
Christos Sakaridis
Yuqian Fu
Luc Van Gool
62
5
0
28 Jan 2025
CrossModalityDiffusion: Multi-Modal Novel View Synthesis with Unified Intermediate Representation
Alex Berian
Daniel Brignac
JhihYang Wu
Natnael Daba
Abhijit Mahalanobis
DiffM
54
1
0
20 Jan 2025
SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces
SynthLight: Portrait Relighting with Diffusion Model by Learning to Re-render Synthetic Faces
Sumit Chaturvedi
Mengwei Ren
Yannick Hold-Geoffroy
Jingyuan Liu
Julie Dorsey
Zhixin Shu
DiffM
66
0
0
17 Jan 2025
An Ordinary Differential Equation Sampler with Stochastic Start for Diffusion Bridge Models
An Ordinary Differential Equation Sampler with Stochastic Start for Diffusion Bridge Models
Yuang Wang
Pengfei Jin
L. Zhang
Quanzheng Li
Zhiqiang Chen
Dufan Wu
DiffM
21
0
0
31 Dec 2024
Similarity Trajectories: Linking Sampling Process to Artifacts in
  Diffusion-Generated Images
Similarity Trajectories: Linking Sampling Process to Artifacts in Diffusion-Generated Images
Dennis Menn
Feng Liang
Hung-Yueh Chiang
Diana Marculescu
DiffM
74
0
0
22 Dec 2024
MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Ho Kei Cheng
Masato Ishii
Akio Hayakawa
Takashi Shibuya
A. Schwing
Yuki Mitsufuji
VGen
126
12
0
19 Dec 2024
From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
Tianwei Yin
Qiang Zhang
Richard Zhang
William T. Freeman
F. Durand
Eli Shechtman
Xun Huang
VGen
DiffM
81
5
0
10 Dec 2024
Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis
Switti: Designing Scale-Wise Transformers for Text-to-Image Synthesis
Anton Voronov
Denis Kuznedelev
Mikhail Khoroshikh
Valentin Khrulkov
Dmitry Baranchuk
108
2
0
02 Dec 2024
Individual Content and Motion Dynamics Preserved Pruning for Video
  Diffusion Models
Individual Content and Motion Dynamics Preserved Pruning for Video Diffusion Models
Yiming Wu
Huan Wang
Zhenghao Chen
Dong Xu
DiffM
VGen
79
1
0
27 Nov 2024
123
Next