Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2404.04920
Cited By
Regularized Conditional Diffusion Model for Multi-Task Preference Alignment
7 April 2024
Xudong Yu
Chenjia Bai
Haoran He
Changhong Wang
Xuelong Li
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Regularized Conditional Diffusion Model for Multi-Task Preference Alignment"
8 / 8 papers shown
Title
Robust Multi-Objective Controlled Decoding of Large Language Models
Seongho Son
William Bankes
Sangwoong Yoon
Shyam Sundhar Ramesh
Xiaohang Tang
Ilija Bogunovic
39
0
0
11 Mar 2025
Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner
Chenyou Fan
Chenjia Bai
Zhao Shan
Haoran He
Yang Zhang
Zhen Wang
28
3
0
30 Sep 2024
Forward KL Regularized Preference Optimization for Aligning Diffusion Policies
Zhao Shan
Chenyou Fan
Shuang Qiu
Jiyuan Shi
Chenjia Bai
33
4
0
09 Sep 2024
Diffusion Models are Minimax Optimal Distribution Estimators
Kazusato Oko
Shunta Akiyama
Taiji Suzuki
DiffM
61
84
0
03 Mar 2023
Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling
Huayu Chen
Cheng Lu
Chengyang Ying
Hang Su
Jun Zhu
DiffM
OffRL
79
103
0
29 Sep 2022
Planning with Diffusion for Flexible Behavior Synthesis
Michael Janner
Yilun Du
J. Tenenbaum
Sergey Levine
DiffM
202
622
0
20 May 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
301
11,730
0
04 Mar 2022
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
206
832
0
12 Oct 2021
1