ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2305.13301
  4. Cited By
Training Diffusion Models with Reinforcement Learning

Training Diffusion Models with Reinforcement Learning

22 May 2023
Kevin Black
Michael Janner
Yilun Du
Ilya Kostrikov
Sergey Levine
    EGVM
ArXivPDFHTML

Papers citing "Training Diffusion Models with Reinforcement Learning"

44 / 244 papers shown
Title
Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion
  Models with RL Finetuning
Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning
Desai Xie
Jiahao Li
Hao Tan
Xin Sun
Zhixin Shu
Yi Zhou
Sai Bi
Soren Pirk
Arie E. Kaufman
21
8
0
21 Dec 2023
InstructVideo: Instructing Video Diffusion Models with Human Feedback
InstructVideo: Instructing Video Diffusion Models with Human Feedback
Hangjie Yuan
Shiwei Zhang
Xiang Wang
Yujie Wei
Tao Feng
Yining Pan
Yingya Zhang
Ziwei Liu
Samuel Albanie
Dong Ni
VGen
13
41
0
19 Dec 2023
An Invitation to Deep Reinforcement Learning
An Invitation to Deep Reinforcement Learning
Bernhard Jaeger
Andreas Geiger
OffRL
OOD
62
5
0
13 Dec 2023
RL Dreams: Policy Gradient Optimization for Score Distillation based 3D
  Generation
RL Dreams: Policy Gradient Optimization for Score Distillation based 3D Generation
Aradhya Neeraj Mathur
Phu-Cuong Pham
Aniket Bera
Ojaswa Sharma
17
0
0
08 Dec 2023
iDesigner: A High-Resolution and Complex-Prompt Following Text-to-Image
  Diffusion Model for Interior Design
iDesigner: A High-Resolution and Complex-Prompt Following Text-to-Image Diffusion Model for Interior Design
Ruyi Gan
Xiaojun Wu
Junyu Lu
Yuanhe Tian
Di Zhang
...
Renliang Sun
Chang Liu
Jiaxing Zhang
Pingjian Zhang
Yan Song
44
4
0
07 Dec 2023
TokenCompose: Text-to-Image Diffusion with Token-level Supervision
TokenCompose: Text-to-Image Diffusion with Token-level Supervision
Zirui Wang
Zhizhou Sha
Zheng Ding
Yilin Wang
Zhuowen Tu
DiffM
16
21
0
06 Dec 2023
Generalized Contrastive Divergence: Joint Training of Energy-Based Model
  and Diffusion Model through Inverse Reinforcement Learning
Generalized Contrastive Divergence: Joint Training of Energy-Based Model and Diffusion Model through Inverse Reinforcement Learning
Sangwoong Yoon
Dohyun Kwon
Himchan Hwang
Yung-Kyun Noh
Frank C. Park
25
0
0
06 Dec 2023
InstructBooth: Instruction-following Personalized Text-to-Image
  Generation
InstructBooth: Instruction-following Personalized Text-to-Image Generation
Daewon Chae
Nokyung Park
Jinkyu Kim
Kimin Lee
DiffM
14
11
0
04 Dec 2023
LVDiffusor: Distilling Functional Rearrangement Priors from Large Models
  into Diffusor
LVDiffusor: Distilling Functional Rearrangement Priors from Large Models into Diffusor
Yiming Zeng
Mingdong Wu
Long Yang
Jiyao Zhang
Hao Ding
Hui Cheng
Hao Dong
DiffM
11
8
0
03 Dec 2023
OPERA: Alleviating Hallucination in Multi-Modal Large Language Models
  via Over-Trust Penalty and Retrospection-Allocation
OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation
Qidong Huang
Xiao-wen Dong
Pan Zhang
Bin Wang
Conghui He
Jiaqi Wang
Dahua Lin
Weiming Zhang
Neng H. Yu
MLLM
26
165
0
29 Nov 2023
Enhancing Diffusion Models with Text-Encoder Reinforcement Learning
Enhancing Diffusion Models with Text-Encoder Reinforcement Learning
Chaofeng Chen
Annan Wang
Haoning Wu
Liang Liao
Wenxiu Sun
Qiong Yan
Weisi Lin
15
9
0
27 Nov 2023
Reinforcement Learning from Diffusion Feedback: Q* for Image Search
Reinforcement Learning from Diffusion Feedback: Q* for Image Search
Aboli Rajan Marathe
VLM
37
0
0
27 Nov 2023
Using Human Feedback to Fine-tune Diffusion Models without Any Reward
  Model
Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model
Kai Yang
Jian Tao
Jiafei Lyu
Chunjiang Ge
Jiaxin Chen
Qimai Li
Weihan Shen
Xiaolong Zhu
Xiu Li
EGVM
16
87
0
22 Nov 2023
Diffusion Model Alignment Using Direct Preference Optimization
Diffusion Model Alignment Using Direct Preference Optimization
Bram Wallace
Meihua Dang
Rafael Rafailov
Linqi Zhou
Aaron Lou
Senthil Purushwalkam
Stefano Ermon
Caiming Xiong
Shafiq R. Joty
Nikhil Naik
EGVM
16
220
0
21 Nov 2023
Behavior Optimized Image Generation
Behavior Optimized Image Generation
Varun Khurana
Yaman Kumar Singla
J. Subramanian
R. Shah
Changyou Chen
Zhiqiang Xu
Balaji Krishnamurthy
EGVM
8
4
0
18 Nov 2023
Posterior Sampling with Delayed Feedback for Reinforcement Learning with
  Linear Function Approximation
Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation
Nikki Lijing Kuang
Ming Yin
Mengdi Wang
Yu-Xiang Wang
Yian Ma
11
6
0
29 Oct 2023
Contrastive Preference Learning: Learning from Human Feedback without RL
Contrastive Preference Learning: Learning from Human Feedback without RL
Joey Hejna
Rafael Rafailov
Harshit S. Sikchi
Chelsea Finn
S. Niekum
W. B. Knox
Dorsa Sadigh
OffRL
13
49
0
20 Oct 2023
Video Language Planning
Video Language Planning
Yilun Du
Mengjiao Yang
Peter R. Florence
Fei Xia
Ayzaan Wahid
...
Pieter Abbeel
Josh Tenenbaum
L. Kaelbling
Andy Zeng
Jonathan Tompson
PINN
LM&Ro
89
83
0
16 Oct 2023
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic
  Image Design and Generation
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation
Zhengyuan Yang
Jianfeng Wang
Linjie Li
Kevin Qinghong Lin
Chung-Ching Lin
Zicheng Liu
Lijuan Wang
LRM
MLLM
DiffM
13
22
0
12 Oct 2023
OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation
OpenLEAF: Open-Domain Interleaved Image-Text Generation and Evaluation
Jie An
Zhengyuan Yang
Linjie Li
Jianfeng Wang
K. Lin
Zicheng Liu
Lijuan Wang
Jiebo Luo
14
11
0
11 Oct 2023
EasyPhoto: Your Smart AI Photo Generator
EasyPhoto: Your Smart AI Photo Generator
Ziheng Wu
Jiaqi Xu
Xinyi Zou
Kunzhe Huang
Xing Shi
Jun Huang
11
4
0
07 Oct 2023
Improved Baselines with Visual Instruction Tuning
Improved Baselines with Visual Instruction Tuning
Haotian Liu
Chunyuan Li
Yuheng Li
Yong Jae Lee
VLM
MLLM
22
2,400
0
05 Oct 2023
Aligning Text-to-Image Diffusion Models with Reward Backpropagation
Aligning Text-to-Image Diffusion Models with Reward Backpropagation
Mihir Prabhudesai
Anirudh Goyal
Deepak Pathak
Katerina Fragkiadaki
19
108
0
05 Oct 2023
Transcending Domains through Text-to-Image Diffusion: A Source-Free Approach to Domain Adaptation
Shivang Chopra
Suraj Kothawade
Houda Aynaou
Aman Chadha
DiffM
19
0
0
02 Oct 2023
Directly Fine-Tuning Diffusion Models on Differentiable Rewards
Directly Fine-Tuning Diffusion Models on Differentiable Rewards
Amita Gajewar
Paul Vicol
G. Bansal
David J Fleet
16
145
0
29 Sep 2023
RL-I2IT: Image-to-Image Translation with Deep Reinforcement Learning
RL-I2IT: Image-to-Image Translation with Deep Reinforcement Learning
Xin Wang
Ziwei Luo
Jing Hu
Chengmin Feng
Shu Hu
Bin Zhu
Xi Wu
Hongtu Zhu
Xin Li
Siwei Lyu
OffRL
VLM
27
1
0
24 Sep 2023
Diffusion-EDFs: Bi-equivariant Denoising Generative Modeling on SE(3)
  for Visual Robotic Manipulation
Diffusion-EDFs: Bi-equivariant Denoising Generative Modeling on SE(3) for Visual Robotic Manipulation
Hyunwoo Ryu
Jiwoo Kim
Hyun Seok Ahn
Junwoo Chang
Joohwan Seo
Taehan Kim
Yubin Kim
Chaewon Hwang
Jongeun Choi
R. Horowitz
DiffM
16
33
0
06 Sep 2023
Reinforcement Learning with Human Feedback for Realistic Traffic
  Simulation
Reinforcement Learning with Human Feedback for Realistic Traffic Simulation
Yulong Cao
B. Ivanovic
Chaowei Xiao
Marco Pavone
11
14
0
01 Sep 2023
Affective Visual Dialog: A Large-Scale Benchmark for Emotional Reasoning
  Based on Visually Grounded Conversations
Affective Visual Dialog: A Large-Scale Benchmark for Emotional Reasoning Based on Visually Grounded Conversations
Kilichbek Haydarov
Xiaoqian Shen
Avinash Madasu
Mahmoud Salem
Jia Li
Gamaleldin F. Elsayed
Mohamed Elhoseiny
28
4
0
30 Aug 2023
Reinforcement Learning for Generative AI: A Survey
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
42
10
0
28 Aug 2023
A Survey of Diffusion Based Image Generation Models: Issues and Their
  Solutions
A Survey of Diffusion Based Image Generation Models: Issues and Their Solutions
Tianyi Zhang
Zheng Wang
Jin Huang
M. M. Tasnim
Wei Shi
VLM
11
21
0
25 Aug 2023
Manipulating Embeddings of Stable Diffusion Prompts
Manipulating Embeddings of Stable Diffusion Prompts
Niklas Deckers
Julia Peters
Martin Potthast
DiffM
32
9
0
23 Aug 2023
Reinforcement Learning for Generative AI: State of the Art,
  Opportunities and Open Research Challenges
Reinforcement Learning for Generative AI: State of the Art, Opportunities and Open Research Challenges
Giorgio Franceschelli
Mirco Musolesi
AI4CE
27
19
0
31 Jul 2023
On the Design Fundamentals of Diffusion Models: A Survey
On the Design Fundamentals of Diffusion Models: A Survey
Ziyi Chang
G. Koulieris
Hubert P. H. Shum
DiffM
27
50
0
07 Jun 2023
DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion
  Models
DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models
Ying Fan
Olivia Watkins
Yuqing Du
Hao Liu
Moonkyung Ryu
Craig Boutilier
Pieter Abbeel
Mohammad Ghavamzadeh
Kangwook Lee
Kimin Lee
28
133
0
25 May 2023
Learning to Evaluate the Artness of AI-generated Images
Learning to Evaluate the Artness of AI-generated Images
Junyu Chen
Jie An
Hanjia Lyu
Christopher Kanan
Jiebo Luo
EGVM
19
11
0
08 May 2023
RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment
RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment
Hanze Dong
Wei Xiong
Deepanshu Goyal
Yihan Zhang
Winnie Chow
Rui Pan
Shizhe Diao
Jipeng Zhang
Kashun Shum
Tong Zhang
ALM
6
397
0
13 Apr 2023
Planning with Diffusion for Flexible Behavior Synthesis
Planning with Diffusion for Flexible Behavior Synthesis
Michael Janner
Yilun Du
J. Tenenbaum
Sergey Levine
DiffM
202
622
0
20 May 2022
Teaching language models to support answers with verified quotes
Teaching language models to support answers with verified quotes
Jacob Menick
Maja Trebacz
Vladimir Mikulik
John Aslanides
Francis Song
...
Mia Glaese
Susannah Young
Lucy Campbell-Gillingham
G. Irving
Nat McAleese
ELM
RALM
229
255
0
21 Mar 2022
Training language models to follow instructions with human feedback
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
303
11,730
0
04 Mar 2022
Crystal Diffusion Variational Autoencoder for Periodic Material
  Generation
Crystal Diffusion Variational Autoencoder for Periodic Material Generation
Tian Xie
Xiang Fu
O. Ganea
Regina Barzilay
Tommi Jaakkola
DiffM
BDL
204
224
0
12 Oct 2021
Creativity and Machine Learning: A Survey
Creativity and Machine Learning: A Survey
Giorgio Franceschelli
Mirco Musolesi
VLM
AI4CE
19
37
0
06 Apr 2021
Zero-Shot Text-to-Image Generation
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
253
4,735
0
24 Feb 2021
Fine-Tuning Language Models from Human Preferences
Fine-Tuning Language Models from Human Preferences
Daniel M. Ziegler
Nisan Stiennon
Jeff Wu
Tom B. Brown
Alec Radford
Dario Amodei
Paul Christiano
G. Irving
ALM
275
1,561
0
18 Sep 2019
Previous
12345