Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2206.02000
Cited By
Hybrid Value Estimation for Off-policy Evaluation and Offline Reinforcement Learning
4 June 2022
Xuefeng Jin
Xu-Hui Liu
Shengyi Jiang
Yang Yu
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hybrid Value Estimation for Off-policy Evaluation and Offline Reinforcement Learning"
5 / 5 papers shown
Title
SOPE: Spectrum of Off-Policy Estimators
C. J. Yuan
Yash Chandak
S. Giguere
Philip S. Thomas
S. Niekum
OffRL
37
5
0
06 Nov 2021
Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble
Gaon An
Seungyong Moon
Jang-Hyun Kim
Hyun Oh Song
OffRL
95
261
0
04 Oct 2021
COMBO: Conservative Offline Model-Based Policy Optimization
Tianhe Yu
Aviral Kumar
Rafael Rafailov
Aravind Rajeswaran
Sergey Levine
Chelsea Finn
OffRL
214
413
0
16 Feb 2021
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning
Rongjun Qin
Songyi Gao
Xingyuan Zhang
Zhen Xu
Shengkai Huang
Zewen Li
Weinan Zhang
Yang Yu
OffRL
132
76
0
01 Feb 2021
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Seyed Kamyar Seyed Ghasemipour
Dale Schuurmans
S. Gu
OffRL
209
119
0
21 Jul 2020
1