ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.13301
  4. Cited By
Training Diffusion Models with Reinforcement Learning
v1v2v3v4 (latest)

Training Diffusion Models with Reinforcement Learning

International Conference on Learning Representations (ICLR), 2023
22 May 2023
Kevin Black
Michael Janner
Yilun Du
Ilya Kostrikov
Sergey Levine
    EGVM
ArXiv (abs)PDFHTMLHuggingFace (4 upvotes)

Papers citing "Training Diffusion Models with Reinforcement Learning"

50 / 270 papers shown
Steering Masked Discrete Diffusion Models via Discrete Denoising
  Posterior Prediction
Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior PredictionInternational Conference on Learning Representations (ICLR), 2024
Jarrid Rector-Brooks
Mohsin Hasan
Zhangzhi Peng
Zachary Quinn
Chenghao Liu
...
Michael Bronstein
Yoshua Bengio
Pranam Chatterjee
Alexander Tong
Avishek Joey Bose
DiffM
273
22
0
10 Oct 2024
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image GenerationInternational Conference on Learning Representations (ICLR), 2024
Xinchen Zhang
Ling Yang
Ge Li
Yaqi Cai
Jiake Xie
Yong Tang
Yujiu Yang
Mengdi Wang
Bin Cui
EGVMCoGe
332
19
0
09 Oct 2024
Gen-Drive: Enhancing Diffusion Generative Driving Policies with Reward
  Modeling and Reinforcement Learning Fine-tuning
Gen-Drive: Enhancing Diffusion Generative Driving Policies with Reward Modeling and Reinforcement Learning Fine-tuningIEEE International Conference on Robotics and Automation (ICRA), 2024
Zhiyu Huang
Xinshuo Weng
Maximilian Igl
Yuxiao Chen
Yulong Cao
Boris Ivanovic
Marco Pavone
Chen Lv
170
30
0
08 Oct 2024
Training-free Diffusion Model Alignment with Sampling Demons
Training-free Diffusion Model Alignment with Sampling DemonsInternational Conference on Learning Representations (ICLR), 2024
Po-Hung Yeh
Kuang-Huei Lee
Jun-Cheng Chen
284
16
0
08 Oct 2024
DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image Editing
DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image Editing
June Suk Choi
Kyungmin Lee
Jongheon Jeong
Saining Xie
Jinwoo Shin
Kimin Lee
DiffMAAML
255
10
0
08 Oct 2024
Bridging SFT and DPO for Diffusion Model Alignment with Self-Sampling Preference Optimization
Bridging SFT and DPO for Diffusion Model Alignment with Self-Sampling Preference Optimization
Daoan Zhang
Guangchen Lan
Dong-Jun Han
Wenlin Yao
Xiaoman Pan
...
Mingxiao Li
Pengcheng Chen
Yu Dong
Christopher G. Brinton
Jiebo Luo
EGVM
348
7
0
07 Oct 2024
HERO: Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model Finetuning
HERO: Human-Feedback Efficient Reinforcement Learning for Online Diffusion Model FinetuningInternational Conference on Learning Representations (ICLR), 2024
Ayano Hiranaka
Shang-Fu Chen
Chieh-Hsin Lai
Dongjun Kim
Naoki Murata
Takashi Shibuya
Wei-Hsiang Liao
Shao-Hua Sun
Yuki Mitsufuji
400
2
0
07 Oct 2024
Text2Chart31: Instruction Tuning for Chart Generation with Automatic Feedback
Text2Chart31: Instruction Tuning for Chart Generation with Automatic FeedbackConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
Fatemeh Pesaran Zadeh
Juyeon Kim
Jin-Hwa Kim
Gunhee Kim
ALM
339
13
0
05 Oct 2024
Tuning Timestep-Distilled Diffusion Model Using Pairwise Sample Optimization
Tuning Timestep-Distilled Diffusion Model Using Pairwise Sample OptimizationInternational Conference on Learning Representations (ICLR), 2024
Zichen Miao
Zhengyuan Yang
Kevin Lin
Ze Wang
Zicheng Liu
Lijuan Wang
Qiang Qiu
400
14
0
04 Oct 2024
ComfyGen: Prompt-Adaptive Workflows for Text-to-Image Generation
ComfyGen: Prompt-Adaptive Workflows for Text-to-Image Generation
Rinon Gal
Adi Haviv
Yuval Alaluf
Amit H. Bermano
Daniel Cohen-Or
Gal Chechik
DiffM
188
8
0
02 Oct 2024
Task-Agnostic Pre-training and Task-Guided Fine-tuning for Versatile Diffusion Planner
Task-Agnostic Pre-training and Task-Guided Fine-tuning for Versatile Diffusion Planner
Chenyou Fan
Chenjia Bai
Zhao Shan
Haoran He
Yang Zhang
Zhen Wang
406
4
0
30 Sep 2024
TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation
TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic ManipulationIEEE Robotics and Automation Letters (RA-L), 2024
Junjie Wen
Yinlin Zhu
Jinming Li
Minjie Zhu
Kun Wu
...
Ran Cheng
Yaxin Peng
Chaomin Shen
Feifei Feng
Jian Tang
LM&Ro
743
217
0
19 Sep 2024
Alignment of Diffusion Models: Fundamentals, Challenges, and Future
Alignment of Diffusion Models: Fundamentals, Challenges, and Future
Buhua Liu
Shitong Shao
Bao Li
Lichen Bai
Zhiqiang Xu
Haoyi Xiong
James Kwok
Sumi Helal
Bo Han
463
22
0
11 Sep 2024
Elucidating Optimal Reward-Diversity Tradeoffs in Text-to-Image
  Diffusion Models
Elucidating Optimal Reward-Diversity Tradeoffs in Text-to-Image Diffusion ModelsIEEE Workshop/Winter Conference on Applications of Computer Vision (WACV), 2024
Rohit Jena
Ali Taghibakhshi
Sahil Jain
Gerald Shen
Nima Tajbakhsh
Arash Vahdat
415
8
0
09 Sep 2024
Reward-Directed Score-Based Diffusion Models via q-Learning
Reward-Directed Score-Based Diffusion Models via q-Learning
Ningyuan Chen
Jiale Zha
X. Zhou
DiffM
257
8
0
07 Sep 2024
RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model
RLCP: A Reinforcement Learning-based Copyright Protection Method for Text-to-Image Diffusion Model
Zhuan Shi
Jing Yan
Xiaoli Tang
Lingjuan Lyu
Boi Faltings
433
1
0
29 Aug 2024
Constrained Diffusion Models via Dual Training
Constrained Diffusion Models via Dual TrainingNeural Information Processing Systems (NeurIPS), 2024
Shervin Khalafi
Dongsheng Ding
Alejandro Ribeiro
311
13
0
27 Aug 2024
Towards Reliable Advertising Image Generation Using Human Feedback
Towards Reliable Advertising Image Generation Using Human FeedbackEuropean Conference on Computer Vision (ECCV), 2024
Thorben Werner
Wei Feng
Haohan Wang
Yaoyu Li
Jingsen Wang
...
Ilia Koloiarov
Junsheng Jin
Lars Schmidt-Thieme
Zhangang Lin
Jingping Shao
341
8
0
01 Aug 2024
Aligning Diffusion Behaviors with Q-functions for Efficient Continuous
  Control
Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control
Huayu Chen
Kaiwen Zheng
Hang Su
Jun Zhu
368
8
0
12 Jul 2024
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for
  Text-to-Image Generation?
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
Zhaorun Chen
Yichao Du
Zichen Wen
Yiyang Zhou
Chenhang Cui
...
Jiawei Zhou
Zhuokai Zhao
Rafael Rafailov
Chelsea Finn
Huaxiu Yao
EGVMMLLM
327
56
0
05 Jul 2024
Diminishing Stereotype Bias in Image Generation Model using
  Reinforcemenlent Learning Feedback
Diminishing Stereotype Bias in Image Generation Model using Reinforcemenlent Learning Feedback
Xin Chen
Virgile Foussereau
EGVM
148
1
0
27 Jun 2024
Aligning Diffusion Models with Noise-Conditioned Perception
Aligning Diffusion Models with Noise-Conditioned Perception
Alexander Gambashidze
Anton Kulikov
Yuriy Sosnin
Ilya Makarov
325
10
0
25 Jun 2024
Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback
  for Text-to-Image Generation
Beyond Thumbs Up/Down: Untangling Challenges of Fine-Grained Feedback for Text-to-Image Generation
Katherine M. Collins
Najoung Kim
Yonatan Bitton
Verena Rieser
Shayegan Omidshafiei
...
Gang Li
Adrian Weller
Junfeng He
Deepak Ramachandran
Krishnamurthy Dvijotham
EGVM
188
3
0
24 Jun 2024
Adding Conditional Control to Diffusion Models with Reinforcement Learning
Adding Conditional Control to Diffusion Models with Reinforcement Learning
Yulai Zhao
Masatoshi Uehara
Gabriele Scalia
Tommaso Biancalani
Sergey Levine
Ehsan Hajiramezanali
Ehsan Hajiramezanali
AI4CE
500
13
0
17 Jun 2024
InstructRL4Pix: Training Diffusion for Image Editing by Reinforcement
  Learning
InstructRL4Pix: Training Diffusion for Image Editing by Reinforcement Learning
Tiancheng Li
Yu Lei
Huajun Chen
Nan Zhuang
EGVM
285
3
0
14 Jun 2024
Margin-aware Preference Optimization for Aligning Diffusion Models without Reference
Margin-aware Preference Optimization for Aligning Diffusion Models without Reference
Jiwoo Hong
Sayak Paul
Noah Lee
Kashif Rasul
James Thorne
Jongheon Jeong
310
31
0
10 Jun 2024
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion
Hao Wen
Zehuan Huang
Yaohui Wang
Xinyuan Chen
Yu Qiao
377
20
0
05 Jun 2024
Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models
Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models
Xinxi Zhang
Song Wen
Ligong Han
Felix Juefei Xu
Akash Srivastava
Junzhou Huang
Hao Wang
Molei Tao
Dimitris N. Metaxas
DiffM
192
9
0
31 May 2024
Amortizing intractable inference in diffusion models for vision, language, and control
Amortizing intractable inference in diffusion models for vision, language, and control
S. Venkatraman
Moksh Jain
Luca Scimeca
Minsu Kim
Marcin Sendera
...
Alexandre Adam
Jarrid Rector-Brooks
Yoshua Bengio
Glen Berseth
Nikolay Malkin
406
52
0
31 May 2024
Curriculum Direct Preference Optimization for Diffusion and Consistency Models
Curriculum Direct Preference Optimization for Diffusion and Consistency Models
Florinel-Alin Croitoru
Vlad Hondru
Radu Tudor Ionescu
Andrii Zadaianchuk
Mubarak Shah
EGVM
626
20
0
22 May 2024
Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models
Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models
Xiaoshi Wu
Yiming Hao
Manyuan Zhang
Keqiang Sun
Zhaoyang Huang
Guanglu Song
Yu Liu
Jiaming Song
EGVM
246
42
0
01 May 2024
Large Multi-modality Model Assisted AI-Generated Image Quality
  Assessment
Large Multi-modality Model Assisted AI-Generated Image Quality Assessment
Puyi Wang
Wei Sun
Zicheng Zhang
Jun Jia
Yanwei Jiang
Zhichao Zhang
Xiongkuo Min
Guangtao Zhai
EGVM
157
26
0
27 Apr 2024
YaART: Yet Another ART Rendering Technology
YaART: Yet Another ART Rendering Technology
Sergey Kastryulin
Artem Konev
Alexander Shishenya
Eugene Lyapustin
Artem Khurshudov
...
Dmitrii Kornilov
Mikhail Romanov
Artem Babenko
Sergei Ovcharenko
Valentin Khrulkov
EGVM
214
3
0
08 Apr 2024
Aligning Diffusion Models by Optimizing Human Utility
Aligning Diffusion Models by Optimizing Human Utility
Shufan Li
Konstantinos Kallidromitis
Akash Gokul
Yusuke Kato
Kazuki Kozuka
305
67
0
06 Apr 2024
Idea-2-3D: Collaborative LMM Agents Enable 3D Model Generation from
  Interleaved Multimodal Inputs
Idea-2-3D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs
Junhao Chen
Xiang Li
Xiaojun Ye
Chao Li
Zhaoxin Fan
Hao Zhao
VGen3DV
400
6
0
05 Apr 2024
Pixel-wise RL on Diffusion Models: Reinforcement Learning from Rich
  Feedback
Pixel-wise RL on Diffusion Models: Reinforcement Learning from Rich Feedback
Mo Kordzanganeh
Danial Keshvary
Nariman Arian
EGVM
121
1
0
05 Apr 2024
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept
  Matching
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept MatchingNeural Information Processing Systems (NeurIPS), 2024
Dongzhi Jiang
Guanglu Song
Xiaoshi Wu
Renrui Zhang
Dazhong Shen
Zhuofan Zong
Yu Liu
Jiaming Song
VLM
459
53
0
04 Apr 2024
Confidence-aware Reward Optimization for Fine-tuning Text-to-Image
  Models
Confidence-aware Reward Optimization for Fine-tuning Text-to-Image ModelsInternational Conference on Learning Representations (ICLR), 2024
Kyuyoung Kim
Jongheon Jeong
Minyong An
Mohammad Ghavamzadeh
Krishnamurthy Dvijotham
Jinwoo Shin
Kimin Lee
EGVM
178
6
0
02 Apr 2024
TextCraftor: Your Text Encoder Can be Image Quality Controller
TextCraftor: Your Text Encoder Can be Image Quality Controller
Yanyu Li
Xian Liu
Vidit Goel
Ju Hu
Yerlan Idelbayev
Dhritiman Sagar
Yanzhi Wang
Sergey Tulyakov
Jian Ren
301
27
0
27 Mar 2024
Antigen-Specific Antibody Design via Direct Energy-based Preference
  Optimization
Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization
Xiangxin Zhou
Dongyu Xue
Ruizhe Chen
Zaixiang Zheng
Liang Wang
Quanquan Gu
DiffM
387
33
0
25 Mar 2024
MyVLM: Personalizing VLMs for User-Specific Queries
MyVLM: Personalizing VLMs for User-Specific Queries
Yuval Alaluf
Elad Richardson
Sergey Tulyakov
Kfir Aberman
Daniel Cohen-Or
MLLMVLM
309
42
0
21 Mar 2024
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with
  Auto-Generated Data
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated DataNeural Information Processing Systems (NeurIPS), 2024
Jialu Li
Jaemin Cho
Yi-Lin Sung
Jaehong Yoon
Mohit Bansal
MoMeDiffM
254
15
0
11 Mar 2024
Fine-tuning of diffusion models via stochastic control: entropy regularization and beyond
Fine-tuning of diffusion models via stochastic control: entropy regularization and beyond
Wenpin Tang
Fuzhong Zhou
402
28
0
10 Mar 2024
On the Challenges and Opportunities in Generative AI
On the Challenges and Opportunities in Generative AI
Laura Manduchi
Kushagra Pandey
Kushagra Pandey
Robert Bamler
Sina Daubener
...
Yixin Wang
F. Wenzel
Frank Wood
Stephan Mandt
Vincent Fortuin
761
40
0
28 Feb 2024
Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized
  Control
Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control
Masatoshi Uehara
Yulai Zhao
Kevin Black
Ehsan Hajiramezanali
Gabriele Scalia
N. Diamant
Alex Tseng
Tommaso Biancalani
Sergey Levine
273
83
0
23 Feb 2024
Score-based Diffusion Models via Stochastic Differential Equations -- a Technical Tutorial
Score-based Diffusion Models via Stochastic Differential Equations -- a Technical TutorialStatistics Survey (Stat. Surv.), 2024
Wenpin Tang
Hanyang Zhao
DiffM
396
40
0
12 Feb 2024
Implicit Diffusion: Efficient Optimization through Stochastic Sampling
Implicit Diffusion: Efficient Optimization through Stochastic Sampling
Pierre Marion
Anna Korba
Peter Bartlett
Mathieu Blondel
Valentin De Bortoli
Arnaud Doucet
Felipe Llinares-López
Courtney Paquette
Quentin Berthet
435
19
0
08 Feb 2024
DITTO: Diffusion Inference-Time T-Optimization for Music Generation
DITTO: Diffusion Inference-Time T-Optimization for Music GenerationInternational Conference on Machine Learning (ICML), 2024
Cheng-i Wang
Julian McAuley
Taylor Berg-Kirkpatrick
Nicholas J. Bryan
DiffM
283
71
0
22 Jan 2024
DiffusionAgent: Navigating Expert Models for Agentic Image Generation
DiffusionAgent: Navigating Expert Models for Agentic Image Generation
Jie Qin
Jie Wu
Weifeng Chen
Yuxi Ren
DiffM
179
53
0
18 Jan 2024
A New Creative Generation Pipeline for Click-Through Rate with Stable
  Diffusion Model
A New Creative Generation Pipeline for Click-Through Rate with Stable Diffusion ModelThe Web Conference (WWW), 2024
Hao Yang
Jianxin Yuan
Shuai Yang
Linhe Xu
Shuo Yuan
Yifan Zeng
219
22
0
17 Jan 2024
Previous
123456
Next