ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2012.06899
  4. Cited By
Semi-supervised reward learning for offline reinforcement learning

Semi-supervised reward learning for offline reinforcement learning

12 December 2020
Ksenia Konyushkova
Konrad Zolna
Y. Aytar
Alexander Novikov
Scott E. Reed
Serkan Cabi
Nando de Freitas
    SSL
    OffRL
ArXivPDFHTML

Papers citing "Semi-supervised reward learning for offline reinforcement learning"

20 / 20 papers shown
Title
Contrastive Imitation Learning for Language-guided Multi-Task Robotic
  Manipulation
Contrastive Imitation Learning for Language-guided Multi-Task Robotic Manipulation
Teli Ma
Jiaming Zhou
Zifan Wang
Ronghe Qiu
Junwei Liang
40
8
0
14 Jun 2024
Multi-Fidelity Reinforcement Learning for Time-Optimal Quadrotor
  Re-planning
Multi-Fidelity Reinforcement Learning for Time-Optimal Quadrotor Re-planning
Gilhyun Ryou
Geoffrey Wang
S. Karaman
32
3
0
13 Mar 2024
Transductive Reward Inference on Graph
Transductive Reward Inference on Graph
B. Qu
Xiaofeng Cao
Qing-Wu Guo
Yi Chang
Ivor W. Tsang
Chengqi Zhang
OffRL
17
0
0
06 Feb 2024
Contrastive Example-Based Control
Contrastive Example-Based Control
Kyle Hatch
Benjamin Eysenbach
Rafael Rafailov
Tianhe Yu
Ruslan Salakhutdinov
Sergey Levine
Chelsea Finn
OffRL
15
3
0
24 Jul 2023
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning
  from Observations
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations
Anqi Li
Byron Boots
Ching-An Cheng
OffRL
11
16
0
30 Mar 2023
CLARE: Conservative Model-Based Reward Learning for Offline Inverse
  Reinforcement Learning
CLARE: Conservative Model-Based Reward Learning for Offline Inverse Reinforcement Learning
Sheng Yue
Guan-Bo Wang
Wei Shao
Zhaofeng Zhang
Sen Lin
Junkai Ren
Junshan Zhang
OffRL
15
20
0
09 Feb 2023
Training Robots to Evaluate Robots: Example-Based Interactive Reward
  Functions for Policy Learning
Training Robots to Evaluate Robots: Example-Based Interactive Reward Functions for Policy Learning
Kun-Yen Huang
E. Hu
Dinesh Jayaraman
OffRL
13
5
0
17 Dec 2022
Real World Offline Reinforcement Learning with Realistic Data Source
Real World Offline Reinforcement Learning with Realistic Data Source
G. Zhou
Liyiming Ke
S. Srinivasa
Abhi Gupta
Aravind Rajeswaran
Vikash Kumar
OffRL
19
21
0
12 Oct 2022
Semi-supervised Batch Learning From Logged Data
Semi-supervised Batch Learning From Logged Data
Gholamali Aminian
Armin Behnamnia
R. Vega
Laura Toni
Chengchun Shi
Hamid R. Rabiee
Omar Rivasplata
Miguel R. D. Rodrigues
OffRL
11
0
0
15 Sep 2022
Discriminator-Weighted Offline Imitation Learning from Suboptimal
  Demonstrations
Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations
Haoran Xu
Xianyuan Zhan
Honglei Yin
Huiling Qin
OffRL
8
65
0
20 Jul 2022
MLGOPerf: An ML Guided Inliner to Optimize Performance
MLGOPerf: An ML Guided Inliner to Optimize Performance
Amir H. Ashouri
Mostafa Elhoushi
Yu-Wei Hua
Xiang Wang
Muhammad Asif Manzoor
Bryan Chan
Yaoqing Gao
15
12
0
18 Jul 2022
Discriminator-Guided Model-Based Offline Imitation Learning
Discriminator-Guided Model-Based Offline Imitation Learning
Wenjia Zhang
Haoran Xu
Haoyi Niu
Peng Cheng
Ming Li
Heming Zhang
Guyue Zhou
Xianyuan Zhan
OffRL
8
16
0
01 Jul 2022
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Contrastive Learning as Goal-Conditioned Reinforcement Learning
Benjamin Eysenbach
Tianjun Zhang
Ruslan Salakhutdinov
Sergey Levine
SSL
OffRL
23
137
0
15 Jun 2022
Reinforcement Guided Multi-Task Learning Framework for Low-Resource
  Stereotype Detection
Reinforcement Guided Multi-Task Learning Framework for Low-Resource Stereotype Detection
Rajkumar Pujari
Erik Oveson
Priyanka Kulkarni
E. Nouri
21
8
0
27 Mar 2022
How to Leverage Unlabeled Data in Offline Reinforcement Learning
How to Leverage Unlabeled Data in Offline Reinforcement Learning
Tianhe Yu
Aviral Kumar
Yevgen Chebotar
Karol Hausman
Chelsea Finn
Sergey Levine
OffRL
17
61
0
03 Feb 2022
Safe Deep RL in 3D Environments using Human Feedback
Safe Deep RL in 3D Environments using Human Feedback
Matthew Rahtz
Vikrant Varma
Ramana Kumar
Zachary Kenton
Shane Legg
Jan Leike
16
4
0
20 Jan 2022
Offline Meta-Reinforcement Learning with Online Self-Supervision
Offline Meta-Reinforcement Learning with Online Self-Supervision
Vitchyr H. Pong
Ashvin Nair
Laura M. Smith
Catherine Huang
Sergey Levine
OffRL
22
66
0
08 Jul 2021
Offline Inverse Reinforcement Learning
Offline Inverse Reinforcement Learning
Firas Jarboui
Vianney Perchet
OffRL
11
13
0
09 Jun 2021
Replacing Rewards with Examples: Example-Based Policy Search via
  Recursive Classification
Replacing Rewards with Examples: Example-Based Policy Search via Recursive Classification
Benjamin Eysenbach
Sergey Levine
Ruslan Salakhutdinov
OffRL
13
50
0
23 Mar 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
321
1,662
0
04 May 2020
1