ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.15194
  4. Cited By
Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized
  Control
v1v2 (latest)

Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control

23 February 2024
Masatoshi Uehara
Yulai Zhao
Kevin Black
Ehsan Hajiramezanali
Gabriele Scalia
N. Diamant
Alex Tseng
Tommaso Biancalani
Sergey Levine
ArXiv (abs)PDFHTML

Papers citing "Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control"

50 / 62 papers shown
Title
Diffusion Fine-Tuning via Reparameterized Policy Gradient of the Soft Q-Function
Diffusion Fine-Tuning via Reparameterized Policy Gradient of the Soft Q-Function
Hyeongyu Kang
Jaewoo Lee
Woocheol Shin
Kiyoung Om
Jinkyoo Park
52
0
0
04 Dec 2025
Value Gradient Guidance for Flow Matching Alignment
Value Gradient Guidance for Flow Matching Alignment
Zhen Liu
Tim Z. Xiao
Carles Domingo-Enrich
Weiyang Liu
Dinghuai Zhang
12
0
0
04 Dec 2025
Iterative Tilting for Diffusion Fine-Tuning
Iterative Tilting for Diffusion Fine-Tuning
Jean Pachebat
Giovanni Conforti
Alain Durmus
Yazid Janati
DiffM
70
0
0
02 Dec 2025
Flow Density Control: Generative Optimization Beyond Entropy-Regularized Fine-Tuning
Flow Density Control: Generative Optimization Beyond Entropy-Regularized Fine-Tuning
Riccardo De Santi
Marin Vlastelica
Ya-Ping Hsieh
Zebang Shen
Niao He
Andreas Krause
AI4CE
55
0
0
27 Nov 2025
Test-Time Alignment of Text-to-Image Diffusion Models via Null-Text Embedding Optimisation
Test-Time Alignment of Text-to-Image Diffusion Models via Null-Text Embedding Optimisation
Taehoon Kim
Henry Gouk
Timothy M. Hospedales
193
0
0
25 Nov 2025
ProxT2I: Efficient Reward-Guided Text-to-Image Generation via Proximal Diffusion
ProxT2I: Efficient Reward-Guided Text-to-Image Generation via Proximal Diffusion
Zhenghan Fang
Jian Zheng
Qiaozi Gao
Xiaofeng Gao
Jeremias Sulam
204
0
0
24 Nov 2025
Coffee: Controllable Diffusion Fine-tuning
Coffee: Controllable Diffusion Fine-tuning
Ziyao Zeng
Jingcheng Ni
Ruyi Liu
Alex Wong
DiffM
153
0
0
18 Nov 2025
Embodiment Transfer Learning for Vision-Language-Action Models
Embodiment Transfer Learning for Vision-Language-Action Models
Chengmeng Li
Yaxin Peng
116
0
0
03 Nov 2025
MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency
MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency
Nicolas Dufour
Lucas Degeorge
Arijit Ghosh
Vicky Kalogeiton
David Picard
EGVM
352
1
0
29 Oct 2025
Schrödinger bridge for generative AI: Soft-constrained formulation and convergence analysis
Schrödinger bridge for generative AI: Soft-constrained formulation and convergence analysis
Jin Ma
Ying Tan
Renyuan Xu
157
0
0
13 Oct 2025
Understanding Sampler Stochasticity in Training Diffusion Models for RLHF
Understanding Sampler Stochasticity in Training Diffusion Models for RLHF
Jiayuan Sheng
Hanyang Zhao
Haoxian Chen
David Yao
Wenpin Tang
135
0
0
12 Oct 2025
Calibrating Generative Models
Calibrating Generative Models
Henry D. Smith
Nathaniel L. Diamant
Brian L. Trippe
132
0
0
11 Oct 2025
Fine-Tuning Diffusion Models via Intermediate Distribution Shaping
Fine-Tuning Diffusion Models via Intermediate Distribution Shaping
Gautham Govind Anil
Shaan Ul Haque
Nithish Kannen
Dheeraj M. Nagaraj
Sanjay Shakkottai
Karthikeyan Shanmugam
116
0
0
03 Oct 2025
Diffusion Alignment as Variational Expectation-Maximization
Diffusion Alignment as Variational Expectation-Maximization
Jaewoo Lee
Minsu Kim
S. Choi
Inhyuck Song
Sujin Yun
Hyeongyu Kang
Woocheol Shin
Taeyoung Yun
Kiyoung Om
Jinkyoo Park
103
0
0
01 Oct 2025
TR2-D2: Tree Search Guided Trajectory-Aware Fine-Tuning for Discrete Diffusion
TR2-D2: Tree Search Guided Trajectory-Aware Fine-Tuning for Discrete Diffusion
Sophia Tang
Yuchen Zhu
Molei Tao
Pranam Chatterjee
134
5
0
29 Sep 2025
DriftLite: Lightweight Drift Control for Inference-Time Scaling of Diffusion Models
DriftLite: Lightweight Drift Control for Inference-Time Scaling of Diffusion Models
Yinuo Ren
Wenhao Gao
Lexing Ying
Grant M. Rotskoff
Jiequn Han
172
3
0
25 Sep 2025
Composition and Alignment of Diffusion Models using Constrained Learning
Composition and Alignment of Diffusion Models using Constrained Learning
Shervin Khalafi
Ignacio Hounie
Dongsheng Ding
Alejandro Ribeiro
144
2
0
26 Aug 2025
Source-Guided Flow Matching
Source-Guided Flow Matching
Zifan Wang
Alice Harting
Matthieu Barreau
Michael M. Zavlanos
Karl H. Johansson
176
1
0
20 Aug 2025
Trust Region Constrained Measure Transport in Path Space for Stochastic Optimal Control and Inference
Trust Region Constrained Measure Transport in Path Space for Stochastic Optimal Control and Inference
Denis Blessing
Julius Berner
Lorenz Richter
Carles Domingo-Enrich
Yuanqi Du
Arash Vahdat
Gerhard Neumann
116
6
0
17 Aug 2025
Noise Hypernetworks: Amortizing Test-Time Compute in Diffusion Models
Noise Hypernetworks: Amortizing Test-Time Compute in Diffusion Models
L. Eyring
Shyamgopal Karthik
Alexey Dosovitskiy
Nataniel Ruiz
Zeynep Akata
DiffM
175
8
0
13 Aug 2025
Inference-Time Scaling of Diffusion Language Models with Particle Gibbs Sampling
Inference-Time Scaling of Diffusion Language Models with Particle Gibbs Sampling
Meihua Dang
Jiaqi Han
Minkai Xu
Kai Xu
Akash Srivastava
Stefano Ermon
DiffM
94
7
0
11 Jul 2025
Training-Free Stein Diffusion Guidance: Posterior Correction for Sampling Beyond High-Density Regions
Training-Free Stein Diffusion Guidance: Posterior Correction for Sampling Beyond High-Density Regions
Van Khoa Nguyen
Lionel Blondé
Alexandros Kalousis
144
0
0
07 Jul 2025
Provable Maximum Entropy Manifold Exploration via Diffusion Models
Provable Maximum Entropy Manifold Exploration via Diffusion Models
Riccardo De Santi
Marin Vlastelica
Ya-Ping Hsieh
Zebang Shen
Niao He
Andreas Krause
DiffM
194
5
0
18 Jun 2025
Nabla-R2D3: Effective and Efficient 3D Diffusion Alignment with 2D Rewards
Nabla-R2D3: Effective and Efficient 3D Diffusion Alignment with 2D Rewards
Qingming Liu
Zhen Liu
Dinghuai Zhang
Kui Jia
245
2
0
18 Jun 2025
When Models Know More Than They Can Explain: Quantifying Knowledge Transfer in Human-AI Collaboration
When Models Know More Than They Can Explain: Quantifying Knowledge Transfer in Human-AI Collaboration
Quan Shi
Carlos E. Jimenez
Shunyu Yao
Nick Haber
Diyi Yang
Karthik Narasimhan
318
1
0
05 Jun 2025
Psi-Sampler: Initial Particle Sampling for SMC-Based Inference-Time Reward Alignment in Score Models
Psi-Sampler: Initial Particle Sampling for SMC-Based Inference-Time Reward Alignment in Score Models
Taehoon Yoon
Yunhong Min
Kyeongmin Yeo
Minhyuk Sung
345
0
0
02 Jun 2025
ChatVLA-2: Vision-Language-Action Model with Open-World Embodied Reasoning from Pretrained Knowledge
ChatVLA-2: Vision-Language-Action Model with Open-World Embodied Reasoning from Pretrained Knowledge
Zhongyi Zhou
Yichen Zhu
Junjie Wen
Chaomin Shen
Yi Xu
LM&RoLRMVLM
353
0
0
28 May 2025
Efficient Controllable Diffusion via Optimal Classifier Guidance
Efficient Controllable Diffusion via Optimal Classifier Guidance
Owen Oertell
Shikun Sun
Yiding Chen
Jin Peng Zhou
Zhiyong Wang
Wen Sun
234
0
0
27 May 2025
Diffusion Blend: Inference-Time Multi-Preference Alignment for Diffusion Models
Diffusion Blend: Inference-Time Multi-Preference Alignment for Diffusion Models
Min Cheng
Fatemeh Doudi
D. Kalathil
Mohammad Ghavamzadeh
P. R. Kumar
266
1
0
24 May 2025
Scaling Image and Video Generation via Test-Time Evolutionary Search
Haoran He
Jiajun Liang
X. Wang
Pengfei Wan
Di Zhang
Kun Gai
Ling Pan
DiffM
368
8
0
23 May 2025
ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation
ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation
Yunhong Min
Daehyeon Choi
Kyeongmin Yeo
Jihyun Lee
Minhyuk Sung
457
1
0
28 Mar 2025
Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing
Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing
Jaihoon Kim
Taehoon Yoon
Jisung Hwang
Minhyuk Sung
DiffM
450
19
0
25 Mar 2025
Learning to Match Unpaired Data with Minimum Entropy Coupling
Mustapha Bounoua
Giulio Franzese
Pietro Michiardi
351
2
0
11 Mar 2025
Posterior Inference with Diffusion Models for High-dimensional Black-box Optimization
Posterior Inference with Diffusion Models for High-dimensional Black-box Optimization
Taeyoung Yun
Kiyoung Om
Jaewoo Lee
Sujin Yun
Jinkyoo Park
403
5
0
24 Feb 2025
Reward-Guided Iterative Refinement in Diffusion Models at Test-Time with Applications to Protein and DNA Design
Reward-Guided Iterative Refinement in Diffusion Models at Test-Time with Applications to Protein and DNA Design
Masatoshi Uehara
Xingyu Su
Yulai Zhao
Xiner Li
Aviv Regev
Shuiwang Ji
Sergey Levine
Tommaso Biancalani
241
11
0
20 Feb 2025
Training-Free Guidance Beyond Differentiability: Scalable Path Steering with Tree Search in Diffusion and Flow Models
Training-Free Guidance Beyond Differentiability: Scalable Path Steering with Tree Search in Diffusion and Flow Models
Yingqing Guo
Yukang Yang
Hui Yuan
Mengdi Wang
408
10
0
17 Feb 2025
DexVLA: Vision-Language Model with Plug-In Diffusion Expert for General Robot Control
DexVLA: Vision-Language Model with Plug-In Diffusion Expert for General Robot Control
Junjie Wen
Yinlin Zhu
Jinming Li
Zhibin Tang
Yaxin Peng
Feifei Feng
VLM
456
101
0
09 Feb 2025
Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
Hanyang Zhao
Haoxian Chen
Ji Zhang
D. Yao
Wenpin Tang
531
16
0
03 Feb 2025
Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets
Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNetsInternational Conference on Learning Representations (ICLR), 2024
Zhen Liu
Tim Z. Xiao
Weiyang Liu
Yoshua Bengio
Dinghuai Zhang
687
19
0
10 Dec 2024
Diffusion-VLA: Generalizable and Interpretable Robot Foundation Model via Self-Generated Reasoning
Diffusion-VLA: Generalizable and Interpretable Robot Foundation Model via Self-Generated Reasoning
Junjie Wen
Minjie Zhu
Yinlin Zhu
Zhibin Tang
Jinming Li
...
Chengmeng Li
Xiaoyu Liu
Chaomin Shen
Yaxin Peng
Feifei Feng
389
13
0
04 Dec 2024
Enhancing Exploration with Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile Manipulation
Enhancing Exploration with Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile ManipulationIEEE Robotics and Automation Letters (RA-L), 2024
Huy Le
Miroslav Gabriel
Tai Hoang
Gerhard Neumann
Ngo Anh Vien
434
1
0
22 Nov 2024
Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design
Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein DesignInternational Conference on Learning Representations (ICLR), 2024
Chenyu Wang
Masatoshi Uehara
Yichun He
Amy Wang
Tommaso Biancalani
Avantika Lal
Tommi Jaakkola
Sergey Levine
Hanchen Wang
Aviv Regev
281
40
0
17 Oct 2024
Steering Masked Discrete Diffusion Models via Discrete Denoising
  Posterior Prediction
Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior PredictionInternational Conference on Learning Representations (ICLR), 2024
Jarrid Rector-Brooks
Mohsin Hasan
Zhangzhi Peng
Zachary Quinn
Chenghao Liu
...
Michael Bronstein
Yoshua Bengio
Pranam Chatterjee
Alexander Tong
Avishek Joey Bose
DiffM
251
22
0
10 Oct 2024
A Taxonomy of Loss Functions for Stochastic Optimal Control
A Taxonomy of Loss Functions for Stochastic Optimal Control
Carles Domingo-Enrich
236
8
0
01 Oct 2024
Scores as Actions: a framework of fine-tuning diffusion models by
  continuous-time reinforcement learning
Scores as Actions: a framework of fine-tuning diffusion models by continuous-time reinforcement learning
Hanyang Zhao
Haoxian Chen
Ji Zhang
David D. Yao
Wenpin Tang
308
11
0
12 Sep 2024
Alignment of Diffusion Models: Fundamentals, Challenges, and Future
Alignment of Diffusion Models: Fundamentals, Challenges, and Future
Buhua Liu
Shitong Shao
Bao Li
Lichen Bai
Zhiqiang Xu
Haoyi Xiong
James Kwok
Sumi Helal
Bo Han
435
22
0
11 Sep 2024
Reward-Directed Score-Based Diffusion Models via q-Learning
Reward-Directed Score-Based Diffusion Models via q-Learning
Ningyuan Chen
Jiale Zha
X. Zhou
DiffM
241
8
0
07 Sep 2024
Constrained Diffusion Models via Dual Training
Constrained Diffusion Models via Dual TrainingNeural Information Processing Systems (NeurIPS), 2024
Shervin Khalafi
Dongsheng Ding
Alejandro Ribeiro
296
13
0
27 Aug 2024
Derivative-Free Guidance in Continuous and Discrete Diffusion Models
  with Soft Value-Based Decoding
Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding
Xiner Li
Yulai Zhao
Chenyu Wang
Gabriele Scalia
Gökçen Eraslan
Surag Nair
Tommaso Biancalani
Aviv Regev
Sergey Levine
Masatoshi Uehara
352
81
0
15 Aug 2024
Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion
  Models: A Tutorial and Review
Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review
Masatoshi Uehara
Yulai Zhao
Tommaso Biancalani
Sergey Levine
286
54
0
18 Jul 2024
12
Next