Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.06899
Cited By
Semi-supervised reward learning for offline reinforcement learning
12 December 2020
Ksenia Konyushkova
Konrad Zolna
Y. Aytar
Alexander Novikov
Scott E. Reed
Serkan Cabi
Nando de Freitas
SSL
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Semi-supervised reward learning for offline reinforcement learning"
20 / 20 papers shown
Title
Contrastive Imitation Learning for Language-guided Multi-Task Robotic Manipulation
Teli Ma
Jiaming Zhou
Zifan Wang
Ronghe Qiu
Junwei Liang
35
8
0
14 Jun 2024
Multi-Fidelity Reinforcement Learning for Time-Optimal Quadrotor Re-planning
Gilhyun Ryou
Geoffrey Wang
S. Karaman
27
3
0
13 Mar 2024
Transductive Reward Inference on Graph
B. Qu
Xiaofeng Cao
Qing-Wu Guo
Yi Chang
Ivor W. Tsang
Chengqi Zhang
OffRL
12
0
0
06 Feb 2024
Contrastive Example-Based Control
Kyle Hatch
Benjamin Eysenbach
Rafael Rafailov
Tianhe Yu
Ruslan Salakhutdinov
Sergey Levine
Chelsea Finn
OffRL
15
3
0
24 Jul 2023
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations
Anqi Li
Byron Boots
Ching-An Cheng
OffRL
6
16
0
30 Mar 2023
CLARE: Conservative Model-Based Reward Learning for Offline Inverse Reinforcement Learning
Sheng Yue
Guan-Bo Wang
Wei Shao
Zhaofeng Zhang
Sen Lin
Junkai Ren
Junshan Zhang
OffRL
15
20
0
09 Feb 2023
Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning
Kun-Yen Huang
E. Hu
Dinesh Jayaraman
OffRL
8
5
0
17 Dec 2022
Real World Offline Reinforcement Learning with Realistic Data Source
G. Zhou
Liyiming Ke
S. Srinivasa
Abhi Gupta
Aravind Rajeswaran
Vikash Kumar
OffRL
12
21
0
12 Oct 2022
Semi-supervised Batch Learning From Logged Data
Gholamali Aminian
Armin Behnamnia
R. Vega
Laura Toni
Chengchun Shi
Hamid R. Rabiee
Omar Rivasplata
Miguel R. D. Rodrigues
OffRL
6
0
0
15 Sep 2022
Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations
Haoran Xu
Xianyuan Zhan
Honglei Yin
Huiling Qin
OffRL
8
65
0
20 Jul 2022
MLGOPerf: An ML Guided Inliner to Optimize Performance
Amir H. Ashouri
Mostafa Elhoushi
Yu-Wei Hua
Xiang Wang
Muhammad Asif Manzoor
Bryan Chan
Yaoqing Gao
10
12
0
18 Jul 2022
Discriminator-Guided Model-Based Offline Imitation Learning
Wenjia Zhang
Haoran Xu
Haoyi Niu
Peng Cheng
Ming Li
Heming Zhang
Guyue Zhou
Xianyuan Zhan
OffRL
8
16
0
01 Jul 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Benjamin Eysenbach
Tianjun Zhang
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
18
137
0
15 Jun 2022
Reinforcement Guided Multi-Task Learning Framework for Low-Resource Stereotype Detection
Rajkumar Pujari
Erik Oveson
Priyanka Kulkarni
E. Nouri
16
8
0
27 Mar 2022
How to Leverage Unlabeled Data in Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Chelsea Finn
Sergey Levine
OffRL
17
61
0
03 Feb 2022
Safe Deep RL in 3D Environments using Human Feedback
Matthew Rahtz
Vikrant Varma
Ramana Kumar
Zachary Kenton
Shane Legg
Jan Leike
11
4
0
20 Jan 2022
Offline Meta-Reinforcement Learning with Online Self-Supervision
Vitchyr H. Pong
Ashvin Nair
Laura M. Smith
Catherine Huang
Sergey Levine
OffRL
15
66
0
08 Jul 2021
Offline Inverse Reinforcement Learning
Firas Jarboui
Vianney Perchet
OffRL
11
13
0
09 Jun 2021
Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
OffRL
8
50
0
23 Mar 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
321
1,662
0
04 May 2020
1