Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.08560
Cited By
Dual RL: Unification and New Methods for Reinforcement and Imitation Learning
16 February 2023
Harshit S. Sikchi
Qinqing Zheng
Amy Zhang
S. Niekum
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Dual RL: Unification and New Methods for Reinforcement and Imitation Learning"
19 / 19 papers shown
Title
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
Haoran Xu
Shuozhe Li
Harshit S. Sikchi
S. Niekum
Amy Zhang
OffRL
25
0
0
17 Apr 2025
A Clean Slate for Offline Reinforcement Learning
Matthew Jackson
Uljad Berdica
Jarek Liesen
Shimon Whiteson
Jakob Foerster
OffRL
OnRL
47
0
0
15 Apr 2025
Rethinking Inverse Reinforcement Learning: from Data Alignment to Task Alignment
Weichao Zhou
Wenchao Li
26
0
0
31 Oct 2024
Diffusing States and Matching Scores: A New Framework for Imitation Learning
Runzhe Wu
Yiding Chen
Gokul Swamy
Kianté Brantley
Wen Sun
DiffM
37
3
0
17 Oct 2024
Imitating Language via Scalable Inverse Reinforcement Learning
Markus Wulfmeier
Michael Bloesch
Nino Vieillard
Arun Ahuja
Jorg Bornschein
...
Jost Tobias Springenberg
Nikola Momchev
Olivier Bachem
Matthieu Geist
Martin Riedmiller
34
9
0
02 Sep 2024
CAT: Caution Aware Transfer in Reinforcement Learning via Distributional Risk
Mohamad Fares El Hajj Chehade
Amrit Singh Bedi
Amy Zhang
Hao Zhu
OffRL
AAML
41
0
0
16 Aug 2024
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning
Liyuan Mao
Haoran Xu
Weinan Zhang
Xianyuan Zhan
Amy Zhang
OffRL
36
5
0
29 Jul 2024
Is Value Learning Really the Main Bottleneck in Offline RL?
Seohong Park
Kevin Frans
Sergey Levine
Aviral Kumar
OffRL
43
7
0
13 Jun 2024
A Dual Approach to Imitation Learning from Observations with Offline Datasets
Harshit S. Sikchi
Caleb Chuck
Amy Zhang
S. Niekum
OffRL
25
4
0
13 Jun 2024
Scaling Laws for Reward Model Overoptimization in Direct Alignment Algorithms
Rafael Rafailov
Yaswanth Chittepu
Ryan Park
Harshit S. Sikchi
Joey Hejna
Bradley Knox
Chelsea Finn
S. Niekum
50
47
0
05 Jun 2024
A Unified Linear Programming Framework for Offline Reward Learning from Human Demonstrations and Feedback
Kihyun Kim
Jiawei Zhang
Asuman Ozdaglar
P. Parrilo
OffRL
33
1
0
20 May 2024
Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning
Caleb Chuck
Carl Qi
M. Munje
Shuozhe Li
Max Rudolph
...
Kavan Mehta
Anthony Wang
Peter Stone
Amy Zhang
S. Niekum
28
5
0
06 May 2024
A Distributional Analogue to the Successor Representation
Harley Wiltzer
Jesse Farebrother
Arthur Gretton
Yunhao Tang
André Barreto
Will Dabney
Marc G. Bellemare
Mark Rowland
36
5
0
13 Feb 2024
SMORE: Score Models for Offline Goal-Conditioned Reinforcement Learning
Harshit S. Sikchi
Rohan Chitnis
Ahmed Touati
A. Geramifard
Amy Zhang
S. Niekum
OffRL
31
6
0
03 Nov 2023
Contrastive Preference Learning: Learning from Human Feedback without RL
Joey Hejna
Rafael Rafailov
Harshit S. Sikchi
Chelsea Finn
S. Niekum
W. B. Knox
Dorsa Sadigh
OffRL
19
50
0
20 Oct 2023
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
Mitsuhiko Nakamoto
Yuexiang Zhai
Anika Singh
Max Sobol Mark
Yi-An Ma
Chelsea Finn
Aviral Kumar
Sergey Levine
OffRL
OnRL
109
108
0
09 Mar 2023
Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping
Hao Sun
Lei Han
Rui Yang
Xiaoteng Ma
Jian Guo
Bolei Zhou
OffRL
OnRL
36
10
0
15 Sep 2022
Offline Reinforcement Learning with Implicit Q-Learning
Ilya Kostrikov
Ashvin Nair
Sergey Levine
OffRL
212
837
0
12 Oct 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
329
1,949
0
04 May 2020
1