Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2003.11126
Cited By
Black-box Off-policy Estimation for Infinite-Horizon Reinforcement Learning
24 March 2020
Ali Mousavi
Lihong Li
Qiang Liu
Denny Zhou
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Black-box Off-policy Estimation for Infinite-Horizon Reinforcement Learning"
5 / 5 papers shown
Title
Offline Reinforcement Learning with Soft Behavior Regularization
Haoran Xu
Xianyuan Zhan
Jianxiong Li
Honglei Yin
OffRL
26
31
0
14 Oct 2021
On Instrumental Variable Regression for Deep Offline Policy Evaluation
Yutian Chen
Liyuan Xu
Çağlar Gülçehre
T. Paine
A. Gretton
Nando de Freitas
Arnaud Doucet
OffRL
39
18
0
21 May 2021
Reliable Off-policy Evaluation for Reinforcement Learning
Jie Wang
Rui Gao
H. Zha
OffRL
22
11
0
08 Nov 2020
Off-policy Evaluation in Infinite-Horizon Reinforcement Learning with Latent Confounders
Andrew Bennett
Nathan Kallus
Lihong Li
Ali Mousavi
OffRL
35
43
0
27 Jul 2020
Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning
Shangtong Zhang
Bo Liu
Shimon Whiteson
23
38
0
22 Apr 2020
1