Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2305.13301
Cited By
v1
v2
v3
v4 (latest)
Training Diffusion Models with Reinforcement Learning
International Conference on Learning Representations (ICLR), 2023
22 May 2023
Kevin Black
Michael Janner
Yilun Du
Ilya Kostrikov
Sergey Levine
EGVM
Re-assign community
ArXiv (abs)
PDF
HTML
HuggingFace (4 upvotes)
Papers citing
"Training Diffusion Models with Reinforcement Learning"
50 / 270 papers shown
EvolvingGrasp: Evolutionary Grasp Generation via Efficient Preference Alignment
Yufei Zhu
Yiming Zhong
Zemin Yang
Peishan Cong
Jingyi Yu
X. Zhu
Y. Ma
408
2
0
18 Mar 2025
Revealing higher-order neural representations of uncertainty with the Noise Estimation through Reinforcement-based Diffusion (NERD) model
Hojjat Azimi Asrari
Megan A. K. Peters
DiffM
486
0
0
18 Mar 2025
PANDORA: Diffusion Policy Learning for Dexterous Robotic Piano Playing
Yanjia Huang
Renjie Li
Zhengzhong Tu
VGen
284
1
0
17 Mar 2025
Reward-Instruct: A Reward-Centric Approach to Fast Photo-Realistic Image Generation
Yihong Luo
Tianyang Hu
Weijian Luo
Kenji Kawaguchi
Jing Tang
EGVM
1.1K
0
0
17 Mar 2025
BalancedDPO: Adaptive Multi-Metric Alignment
Dipesh Tamboli
Souradip Chakraborty
Aditya Malusare
B. Banerjee
Amrit Singh Bedi
Vaneet Aggarwal
EGVM
225
2
0
16 Mar 2025
SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering
Byeongjun Park
Hyojun Go
Hyelin Nam
Byung-Hoon Kim
Hyungjin Chung
Changick Kim
VGen
LLMSV
400
5
0
15 Mar 2025
Towards Better Alignment: Training Diffusion Models with Reinforcement Learning Against Sparse Rewards
Computer Vision and Pattern Recognition (CVPR), 2025
Zijing Hu
Tai-wei Chang
Long Chen
Kun Kuang
Jiahui Li
Kaifeng Gao
Jun Xiao
X. Wang
Wenwu Zhu
EGVM
583
18
0
14 Mar 2025
Controllable Latent Diffusion for Traffic Simulation
Yizhuo Xiao
Mustafa Suphi Erden
Cheng Wang
367
1
0
14 Mar 2025
Flow to the Mode: Mode-Seeking Diffusion Autoencoders for State-of-the-Art Image Tokenization
Kyle Sargent
Kyle Hsu
Justin Johnson
L. Fei-Fei
Jiajun Wu
DiffM
MU
453
23
0
14 Mar 2025
Adding Additional Control to One-Step Diffusion with Joint Distribution Matching
Yihong Luo
Tianyang Hu
Yifan Song
Jiacheng Sun
Hao Sun
Jing Tang
DiffM
336
4
0
13 Mar 2025
Learning Personalized Driving Styles via Reinforcement Learning from Human Feedback
Derun Li
Jianwei Ren
Y. Wang
Xin Wen
Pengxiang Li
...
Zhongpu Xia
Fu Liu
Xianpeng Lang
Ningyi Xu
Hang Zhao
285
12
0
13 Mar 2025
Aligning Text to Image in Diffusion Models is Easier Than You Think
J. Lee
Byunghee Cha
Jeongsol Kim
Jong Chul Ye
711
8
0
11 Mar 2025
Preference-Based Alignment of Discrete Diffusion Models
Umberto Borso
Davide Paglieri
Jude Wells
Tim Rocktaschel
268
6
0
11 Mar 2025
Learning to Match Unpaired Data with Minimum Entropy Coupling
Mustapha Bounoua
Giulio Franzese
Pietro Michiardi
369
2
0
11 Mar 2025
Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model
Lixue Gong
Xiaoxia Hou
Fanshi Li
Liang Li
Xiaochen Lian
...
Tao Gui
Yuwei Zhang
Shijia Zhao
Jianchao Yang
Weilin Huang
DiffM
VLM
362
41
0
10 Mar 2025
Boosting Diffusion-Based Text Image Super-Resolution Model Towards Generalized Real-World Scenarios
Chenglu Pan
Xiaohan Li
Ganggui Ding
Yunke Zhang
Wenbo Li
Jiarong Xu
Qingbiao Wu
455
2
0
10 Mar 2025
Dynamic Search for Inference-Time Alignment in Diffusion Models
Xiner Li
Masatoshi Uehara
Xingyu Su
Gabriele Scalia
Tommaso Biancalani
Aviv Regev
Sergey Levine
Shuiwang Ji
417
21
0
03 Mar 2025
A Simple and Effective Reinforcement Learning Method for Text-to-Image Diffusion Fine-tuning
Shashank Gupta
Chaitanya Ahuja
Tsung-Yu Lin
Sreya Dutta Roy
Harrie Oosterhuis
Maarten de Rijke
Satya Narayan Shukla
588
17
0
02 Mar 2025
Posterior Inference with Diffusion Models for High-dimensional Black-box Optimization
Taeyoung Yun
Kiyoung Om
Jaewoo Lee
Sujin Yun
Jinkyoo Park
420
5
0
24 Feb 2025
Score-Based Diffusion Policy Compatible with Reinforcement Learning via Optimal Transport
Mingyang Sun
Pengxiang Ding
Weinan Zhang
Donglin Wang
OT
416
4
0
24 Feb 2025
Reward-Guided Iterative Refinement in Diffusion Models at Test-Time with Applications to Protein and DNA Design
Masatoshi Uehara
Xingyu Su
Yulai Zhao
Xiner Li
Aviv Regev
Shuiwang Ji
Sergey Levine
Tommaso Biancalani
242
12
0
20 Feb 2025
CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation
Minghao Fu
Guo-Hua Wang
Liangfu Cao
Qing-Guo Chen
Zhao Xu
Weihua Luo
Kaifu Zhang
DiffM
398
4
0
18 Feb 2025
Training-Free Guidance Beyond Differentiability: Scalable Path Steering with Tree Search in Diffusion and Flow Models
Yingqing Guo
Yukang Yang
Hui Yuan
Mengdi Wang
428
10
0
17 Feb 2025
Learning a Diffusion Model Policy from Rewards via Q-Score Matching
International Conference on Machine Learning (ICML), 2023
Michael Psenka
Alejandro Escontrela
Pieter Abbeel
Yi-An Ma
DiffM
461
58
0
17 Feb 2025
Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation
Computer Vision and Pattern Recognition (CVPR), 2025
Taeyoung Yun
Dinghuai Zhang
Jinkyoo Park
Ling Pan
DiffM
317
12
0
17 Feb 2025
DexVLA: Vision-Language Model with Plug-In Diffusion Expert for General Robot Control
Junjie Wen
Yinlin Zhu
Jinming Li
Zhibin Tang
Yaxin Peng
Feifei Feng
VLM
478
104
0
09 Feb 2025
Dual Caption Preference Optimization for Diffusion Models
Amir Saeidi
Yiran Luo
Agneet Chatterjee
Shamanthak Hegde
Bimsara Pathiraja
Yezhou Yang
Chitta Baral
DiffM
326
1
0
09 Feb 2025
Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
Hanyang Zhao
Haoxian Chen
Ji Zhang
D. Yao
Wenpin Tang
570
16
0
03 Feb 2025
Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods
Oussama Zekri
Nicolas Boullé
DiffM
606
17
0
03 Feb 2025
Refining Alignment Framework for Diffusion Models with Intermediate-Step Preference Ranking
Jie Ren
Yuhang Zhang
Dongrui Liu
Xiaopeng Zhang
Qi Tian
281
5
0
01 Feb 2025
Visual Generation Without Guidance
Huayu Chen
Kai Jiang
Kaiwen Zheng
Jianfei Chen
Hang Su
Jun Zhu
VLM
446
10
0
26 Jan 2025
Improving Video Generation with Human Feedback
Jie Liu
Gongye Liu
Jiajun Liang
Ziyang Yuan
Xiaokun Liu
...
Fei Yang
Pengfei Wan
Di Zhang
Kun Gai
Yujiu Yang
VGen
EGVM
491
103
0
23 Jan 2025
DiffDoctor: Diagnosing Image Diffusion Models Before Treating
Yiyang Wang
Xi Chen
Xiaohan Li
S. Ji
Yongxu Liu
Yujun Shen
Hengshuang Zhao
DiffM
366
1
0
21 Jan 2025
FDPP: Fine-tune Diffusion Policy with Human Preference
IEEE International Conference on Robotics and Automation (ICRA), 2025
Yuxin Chen
Devesh K. Jha
Masayoshi Tomizuka
Diego Romeres
329
8
0
14 Jan 2025
Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints
AAAI Conference on Artificial Intelligence (AAAI), 2025
Jonathan Nöther
Adish Singla
Goran Radanović
AAML
382
0
0
14 Jan 2025
A General Framework for Inference-time Scaling and Steering of Diffusion Models
R. Singhal
Zachary Horvitz
Ryan Teehan
Mengye Ren
Zhou Yu
Kathleen McKeown
Rajesh Ranganath
DiffM
574
101
0
12 Jan 2025
AdaDiff: Adaptive Step Selection for Fast Diffusion Models
Hui Zhang
Zuxuan Wu
Zhen Xing
Jie Shao
Yu-Gang Jiang
332
19
0
31 Dec 2024
Pareto-Optimal Energy Alignment for Designing Nature-Like Antibodies
Yibo Wen
Chenwei Xu
Jerry Yao-Chieh Hu
Han Liu
Han Liu
DiffM
266
6
0
30 Dec 2024
Efficient Diversity-Preserving Diffusion Alignment via Gradient-Informed GFlowNets
International Conference on Learning Representations (ICLR), 2024
Zhen Liu
Tim Z. Xiao
Weiyang Liu
Yoshua Bengio
Dinghuai Zhang
699
19
0
10 Dec 2024
DyMO: Training-Free Diffusion Model Alignment with Dynamic Multi-Objective Scheduling
Computer Vision and Pattern Recognition (CVPR), 2024
Xin Xie
Dong Gong
582
13
0
01 Dec 2024
Enhancing Exploration with Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile Manipulation
IEEE Robotics and Automation Letters (RA-L), 2024
Huy Le
Miroslav Gabriel
Tai Hoang
Gerhard Neumann
Ngo Anh Vien
456
2
0
22 Nov 2024
Reward Fine-Tuning Two-Step Diffusion Models via Learning Differentiable Latent-Space Surrogate Reward
Computer Vision and Pattern Recognition (CVPR), 2024
Zhiwei Jia
Yuesong Nan
Huixi Zhao
Gengdai Liu
EGVM
540
8
0
22 Nov 2024
FlipSketch: Flipping Static Drawings to Text-Guided Sketch Animations
Computer Vision and Pattern Recognition (CVPR), 2024
Hmrishav Bandyopadhyay
Yi-Zhe Song
DiffM
VGen
233
6
0
16 Nov 2024
David and Goliath: Small One-step Model Beats Large Diffusion with Score Post-training
Weijian Luo
C. Zhang
Debing Zhang
Zhengyang Geng
379
4
0
28 Oct 2024
Towards Visual Text Design Transfer Across Languages
Neural Information Processing Systems (NeurIPS), 2024
Yejin Choi
Jiwan Chung
Sumin Shim
Giyeong Oh
Youngjae Yu
VLM
DiffM
153
1
0
24 Oct 2024
Diff-Instruct++: Training One-step Text-to-image Generator Model to Align with Human Preferences
Weijian Luo
EGVM
351
18
0
24 Oct 2024
Training Free Guided Flow Matching with Optimal Control
International Conference on Learning Representations (ICLR), 2024
Luran Wang
Chaoran Cheng
Yizhen Liao
Yanru Qu
Ge Liu
426
10
0
23 Oct 2024
Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design
International Conference on Learning Representations (ICLR), 2024
Chenyu Wang
Masatoshi Uehara
Yichun He
Amy Wang
Tommaso Biancalani
Avantika Lal
Tommi Jaakkola
Sergey Levine
Hanchen Wang
Aviv Regev
286
40
0
17 Oct 2024
Preference Optimization with Multi-Sample Comparisons
Chaoqi Wang
Zhuokai Zhao
Chen Zhu
Karthik Abinav Sankararaman
Michal Valko
...
Zhaorun Chen
Madian Khabsa
Yuxin Chen
Hao Ma
Sinong Wang
337
14
0
16 Oct 2024
Improving Long-Text Alignment for Text-to-Image Diffusion Models
International Conference on Learning Representations (ICLR), 2024
Luping Liu
Chao Du
Tianyu Pang
Zehan Wang
Chongxuan Li
Dong Xu
VLM
308
12
0
15 Oct 2024
Previous
1
2
3
4
5
6
Next