ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.13301
  4. Cited By
Training Diffusion Models with Reinforcement Learning

Training Diffusion Models with Reinforcement Learning

22 May 2023
Kevin Black
Michael Janner
Yilun Du
Ilya Kostrikov
Sergey Levine
    EGVM
ArXivPDFHTML

Papers citing "Training Diffusion Models with Reinforcement Learning"

50 / 244 papers shown
Title
Curriculum Direct Preference Optimization for Diffusion and Consistency Models
Curriculum Direct Preference Optimization for Diffusion and Consistency Models
Florinel-Alin Croitoru
Vlad Hondru
Radu Tudor Ionescu
N. Sebe
Mubarak Shah
EGVM
81
5
0
22 May 2024
MoDiPO: text-to-motion alignment via AI-feedback-driven Direct
  Preference Optimization
MoDiPO: text-to-motion alignment via AI-feedback-driven Direct Preference Optimization
Massimiliano Pappa
Luca Collorone
Giovanni Ficarra
Indro Spinelli
Fabio Galasso
43
1
0
06 May 2024
Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models
Deep Reward Supervisions for Tuning Text-to-Image Diffusion Models
Xiaoshi Wu
Yiming Hao
Manyuan Zhang
Keqiang Sun
Zhaoyang Huang
Guanglu Song
Yu Liu
Hongsheng Li
EGVM
68
16
0
01 May 2024
Large Multi-modality Model Assisted AI-Generated Image Quality
  Assessment
Large Multi-modality Model Assisted AI-Generated Image Quality Assessment
Puyi Wang
Wei Sun
Zicheng Zhang
Jun Jia
Yanwei Jiang
Zhichao Zhang
Xiongkuo Min
Guangtao Zhai
EGVM
34
11
0
27 Apr 2024
REBEL: Reinforcement Learning via Regressing Relative Rewards
REBEL: Reinforcement Learning via Regressing Relative Rewards
Zhaolin Gao
Jonathan D. Chang
Wenhao Zhan
Owen Oertell
Gokul Swamy
Kianté Brantley
Thorsten Joachims
J. Andrew Bagnell
Jason D. Lee
Wen Sun
OffRL
30
31
0
25 Apr 2024
ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with
  Reward Feedback Learning
ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
Weifeng Chen
Jiacheng Zhang
Jie Wu
Hefeng Wu
Xuefeng Xiao
Liang Lin
28
12
0
23 Apr 2024
Multimodal Large Language Model is a Human-Aligned Annotator for
  Text-to-Image Generation
Multimodal Large Language Model is a Human-Aligned Annotator for Text-to-Image Generation
Xun Wu
Shaohan Huang
Furu Wei
42
8
0
23 Apr 2024
Gradient Guidance for Diffusion Models: An Optimization Perspective
Gradient Guidance for Diffusion Models: An Optimization Perspective
Yingqing Guo
Hui Yuan
Yukang Yang
Minshuo Chen
Mengdi Wang
18
20
0
23 Apr 2024
RLHF Deciphered: A Critical Analysis of Reinforcement Learning from
  Human Feedback for LLMs
RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs
Shreyas Chaudhari
Pranjal Aggarwal
Vishvak Murahari
Tanmay Rajpurohit
A. Kalyan
Karthik Narasimhan
A. Deshpande
Bruno Castro da Silva
21
33
0
12 Apr 2024
ControlNet++: Improving Conditional Controls with Efficient Consistency
  Feedback
ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback
Ming Li
Taojiannan Yang
Huafeng Kuang
Jie Wu
Zhaoning Wang
Xuefeng Xiao
C. L. P. Chen
33
59
0
11 Apr 2024
An Overview of Diffusion Models: Applications, Guided Generation,
  Statistical Rates and Optimization
An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization
Minshuo Chen
Song Mei
Jianqing Fan
Mengdi Wang
VLM
MedIm
DiffM
32
48
0
11 Apr 2024
YaART: Yet Another ART Rendering Technology
YaART: Yet Another ART Rendering Technology
Sergey Kastryulin
Artem Konev
Alexander Shishenya
Eugene Lyapustin
Artem Khurshudov
...
Dmitrii Kornilov
Mikhail Romanov
Artem Babenko
Sergei Ovcharenko
Valentin Khrulkov
EGVM
25
1
0
08 Apr 2024
UniFL: Improve Stable Diffusion via Unified Feedback Learning
UniFL: Improve Stable Diffusion via Unified Feedback Learning
Jiacheng Zhang
Jie Wu
Yuxi Ren
Xin Xia
Huafeng Kuang
...
Jiashi Li
Xuefeng Xiao
Min Zheng
Lean Fu
Guanbin Li
37
2
0
08 Apr 2024
Aligning Diffusion Models by Optimizing Human Utility
Aligning Diffusion Models by Optimizing Human Utility
Shufan Li
Konstantinos Kallidromitis
Akash Gokul
Yusuke Kato
Kazuki Kozuka
103
27
0
06 Apr 2024
Idea-2-3D: Collaborative LMM Agents Enable 3D Model Generation from
  Interleaved Multimodal Inputs
Idea-2-3D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs
Junhao Chen
Xiang Li
Xiaojun Ye
Chao Li
Zhaoxin Fan
Hao Zhao
VGen
3DV
197
4
0
05 Apr 2024
Pixel-wise RL on Diffusion Models: Reinforcement Learning from Rich
  Feedback
Pixel-wise RL on Diffusion Models: Reinforcement Learning from Rich Feedback
Mo Kordzanganeh
Danial Keshvary
Nariman Arian
EGVM
16
0
0
05 Apr 2024
Identity Decoupling for Multi-Subject Personalization of Text-to-Image
  Models
Identity Decoupling for Multi-Subject Personalization of Text-to-Image Models
Sang-Sub Jang
Jaehyeong Jo
Kimin Lee
Sung Ju Hwang
21
15
0
05 Apr 2024
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept
  Matching
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
Dongzhi Jiang
Guanglu Song
Xiaoshi Wu
Renrui Zhang
Dazhong Shen
Zhuofan Zong
Yu Liu
Hongsheng Li
VLM
30
20
0
04 Apr 2024
Confidence-aware Reward Optimization for Fine-tuning Text-to-Image
  Models
Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Models
Kyuyoung Kim
Jongheon Jeong
Minyong An
Mohammad Ghavamzadeh
Krishnamurthy Dvijotham
Jinwoo Shin
Kimin Lee
EGVM
29
6
0
02 Apr 2024
TextCraftor: Your Text Encoder Can be Image Quality Controller
TextCraftor: Your Text Encoder Can be Image Quality Controller
Yanyu Li
Xian Liu
Anil Kag
Ju Hu
Yerlan Idelbayev
Dhritiman Sagar
Yanzhi Wang
Sergey Tulyakov
Jian Ren
34
13
0
27 Mar 2024
VersaT2I: Improving Text-to-Image Models with Versatile Reward
VersaT2I: Improving Text-to-Image Models with Versatile Reward
Jianshu Guo
Wenhao Chai
Jie Deng
Hsiang-Wei Huang
Tianbo Ye
Yichen Xu
Jiawei Zhang
Jenq-Neng Hwang
Gaoang Wang
VLM
33
15
0
27 Mar 2024
Antigen-Specific Antibody Design via Direct Energy-based Preference
  Optimization
Antigen-Specific Antibody Design via Direct Energy-based Preference Optimization
Xiangxin Zhou
Dongyu Xue
Ruizhe Chen
Zaixiang Zheng
Liang Wang
Quanquan Gu
DiffM
33
19
0
25 Mar 2024
Selectively Informative Description can Reduce Undesired Embedding
  Entanglements in Text-to-Image Personalization
Selectively Informative Description can Reduce Undesired Embedding Entanglements in Text-to-Image Personalization
Jimyeong Kim
Jungwon Park
Wonjong Rhee
DiffM
19
5
0
22 Mar 2024
DreamReward: Text-to-3D Generation with Human Preference
DreamReward: Text-to-3D Generation with Human Preference
Junliang Ye
Fangfu Liu
Qixiu Li
Zhengyi Wang
Yikai Wang
Xinzhou Wang
Yueqi Duan
Jun Zhu
66
20
0
21 Mar 2024
MyVLM: Personalizing VLMs for User-Specific Queries
MyVLM: Personalizing VLMs for User-Specific Queries
Yuval Alaluf
Elad Richardson
Sergey Tulyakov
Kfir Aberman
Daniel Cohen-Or
MLLM
VLM
26
18
0
21 Mar 2024
AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in
  Text-to-Image Generation
AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation
Jingkun An
Yinghao Zhu
Zongjian Li
Haoran Feng
Bohua Chen
Yemin Shi
Chengwei Pan
21
2
0
20 Mar 2024
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Brandon McKinzie
Zhe Gan
J. Fauconnier
Sam Dodge
Bowen Zhang
...
Zirui Wang
Ruoming Pang
Peter Grasch
Alexander Toshev
Yinfei Yang
MLLM
27
185
0
14 Mar 2024
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with
  Auto-Generated Data
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
Jialu Li
Jaemin Cho
Yi-Lin Sung
Jaehong Yoon
Mohit Bansal
MoMe
DiffM
34
8
0
11 Mar 2024
Advancing Text-Driven Chest X-Ray Generation with Policy-Based
  Reinforcement Learning
Advancing Text-Driven Chest X-Ray Generation with Policy-Based Reinforcement Learning
Woojung Han
Chanyoung Kim
Dayun Ju
Yumin Shim
Seong Jae Hwang
MedIm
19
8
0
11 Mar 2024
Fine-tuning of diffusion models via stochastic control: entropy
  regularization and beyond
Fine-tuning of diffusion models via stochastic control: entropy regularization and beyond
Wenpin Tang
33
13
0
10 Mar 2024
Stabilizing Policy Gradients for Stochastic Differential Equations via
  Consistency with Perturbation Process
Stabilizing Policy Gradients for Stochastic Differential Equations via Consistency with Perturbation Process
Xiangxin Zhou
Liang Wang
Yichi Zhou
DiffM
21
4
0
07 Mar 2024
Reconciling Reality through Simulation: A Real-to-Sim-to-Real Approach
  for Robust Manipulation
Reconciling Reality through Simulation: A Real-to-Sim-to-Real Approach for Robust Manipulation
M. Torné
Anthony Simeonov
Zechu Li
April Chan
Tao Chen
Abhishek Gupta
Pulkit Agrawal
42
51
0
06 Mar 2024
SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images
  via Vision-Language Model
SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model
Bin Cao
Jianhao Yuan
Yexin Liu
Jian Li
Shuyang Sun
Jing Liu
Bo-Lu Zhao
DiffM
27
7
0
28 Feb 2024
On the Challenges and Opportunities in Generative AI
On the Challenges and Opportunities in Generative AI
Laura Manduchi
Kushagra Pandey
Robert Bamler
Ryan Cotterell
Sina Daubener
...
F. Wenzel
Frank Wood
Stephan Mandt
Vincent Fortuin
Vincent Fortuin
54
17
0
28 Feb 2024
Feedback Efficient Online Fine-Tuning of Diffusion Models
Feedback Efficient Online Fine-Tuning of Diffusion Models
Masatoshi Uehara
Yulai Zhao
Kevin Black
Ehsan Hajiramezanali
Gabriele Scalia
N. Diamant
Alex Tseng
Sergey Levine
Tommaso Biancalani
22
21
0
26 Feb 2024
Graph Diffusion Policy Optimization
Graph Diffusion Policy Optimization
Yijing Liu
Chao Du
Tianyu Pang
Chongxuan Li
Wei Chen
Min-Bin Lin
24
7
0
26 Feb 2024
Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized
  Control
Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control
Masatoshi Uehara
Yulai Zhao
Kevin Black
Ehsan Hajiramezanali
Gabriele Scalia
N. Diamant
Alex Tseng
Tommaso Biancalani
Sergey Levine
24
42
0
23 Feb 2024
Discrete Probabilistic Inference as Control in Multi-path Environments
Discrete Probabilistic Inference as Control in Multi-path Environments
T. Deleu
Padideh Nouri
Nikolay Malkin
Doina Precup
Yoshua Bengio
109
28
0
15 Feb 2024
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Huizhuo Yuan
Zixiang Chen
Kaixuan Ji
Quanquan Gu
55
24
0
15 Feb 2024
Rewards-in-Context: Multi-objective Alignment of Foundation Models with
  Dynamic Preference Adjustment
Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment
Rui Yang
Xiaoman Pan
Feng Luo
Shuang Qiu
Han Zhong
Dong Yu
Jianshu Chen
92
65
0
15 Feb 2024
PRDP: Proximal Reward Difference Prediction for Large-Scale Reward
  Finetuning of Diffusion Models
PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models
Fei Deng
Qifei Wang
Wei Wei
Matthias Grundmann
Tingbo Hou
EGVM
17
15
0
13 Feb 2024
Confronting Reward Overoptimization for Diffusion Models: A Perspective
  of Inductive and Primacy Biases
Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases
Ziyi Zhang
Sen Zhang
Yibing Zhan
Yong Luo
Yonggang Wen
Dacheng Tao
EGVM
33
8
0
13 Feb 2024
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
Shentao Yang
Tianqi Chen
Mingyuan Zhou
EGVM
30
22
0
13 Feb 2024
Score-based Diffusion Models via Stochastic Differential Equations -- a
  Technical Tutorial
Score-based Diffusion Models via Stochastic Differential Equations -- a Technical Tutorial
Wenpin Tang
Hanyang Zhao
DiffM
33
23
0
12 Feb 2024
Implicit Diffusion: Efficient Optimization through Stochastic Sampling
Implicit Diffusion: Efficient Optimization through Stochastic Sampling
Pierre Marion
Anna Korba
Peter Bartlett
Mathieu Blondel
Valentin De Bortoli
Arnaud Doucet
Felipe Llinares-López
Courtney Paquette
Quentin Berthet
74
11
0
08 Feb 2024
DITTO: Diffusion Inference-Time T-Optimization for Music Generation
DITTO: Diffusion Inference-Time T-Optimization for Music Generation
Zachary Novack
Julian McAuley
Taylor Berg-Kirkpatrick
Nicholas J. Bryan
DiffM
13
32
0
22 Jan 2024
DiffusionGPT: LLM-Driven Text-to-Image Generation System
DiffusionGPT: LLM-Driven Text-to-Image Generation System
Jie Qin
Jie Wu
Weifeng Chen
Yuxi Ren
Huixian Li
Hefeng Wu
Xuefeng Xiao
Rui Wang
S. Wen
DiffM
50
22
0
18 Jan 2024
A New Creative Generation Pipeline for Click-Through Rate with Stable
  Diffusion Model
A New Creative Generation Pipeline for Click-Through Rate with Stable Diffusion Model
Hao Yang
Jianxin Yuan
Shuai Yang
Linhe Xu
Shuo Yuan
Yifan Zeng
15
11
0
17 Jan 2024
Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for
  Text-to-Image Generation
Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation
Seung Hyun Lee
Yinxiao Li
Junjie Ke
Innfarn Yoo
Han Zhang
...
Junfeng He
Gang Li
Sangpil Kim
Irfan Essa
Feng Yang
EGVM
21
18
0
11 Jan 2024
Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training
Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training
Xinyan Chen
Jiaxin Ge
Tianjun Zhang
Jiaming Liu
Shanghang Zhang
VLM
EGVM
27
0
0
23 Dec 2023
Previous
12345
Next