Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.03493
Cited By
More Robust Doubly Robust Off-policy Evaluation
10 February 2018
Mehrdad Farajtabar
Yinlam Chow
Mohammad Ghavamzadeh
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"More Robust Doubly Robust Off-policy Evaluation"
17 / 67 papers shown
Title
BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning
Xinyue Chen
Zijian Zhou
Junyao Xing
Che Wang
Yanqiu Wu
Keith Ross
OffRL
35
121
0
27 Oct 2019
Large-scale Causal Approaches to Debiasing Post-click Conversion Rate Estimation with Multi-task Learning
Wenhao Zhang
Wentian Bao
Xiao-Yang Liu
Keping Yang
Quan Lin
Hong Wen
Ramin Ramezani
CML
29
104
0
16 Oct 2019
Adaptive Trade-Offs in Off-Policy Learning
Mark Rowland
Will Dabney
Rémi Munos
OffRL
25
22
0
16 Oct 2019
Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation
Ziyang Tang
Yihao Feng
Lihong Li
Dengyong Zhou
Qiang Liu
OffRL
30
67
0
16 Oct 2019
Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning
Nathan Kallus
Masatoshi Uehara
OffRL
26
88
0
12 Sep 2019
Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes
Nathan Kallus
Masatoshi Uehara
OffRL
46
183
0
22 Aug 2019
Doubly-Robust Lasso Bandit
Gi-Soo Kim
M. Paik
24
61
0
26 Jul 2019
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog
Natasha Jaques
Asma Ghandeharioun
J. Shen
Craig Ferguson
Àgata Lapedriza
Noah J. Jones
S. Gu
Rosalind W. Picard
OffRL
45
337
0
30 Jun 2019
Bias Correction of Learned Generative Models using Likelihood-Free Importance Weighting
Aditya Grover
Jiaming Song
Alekh Agarwal
Kenneth Tran
Ashish Kapoor
Eric Horvitz
Stefano Ermon
26
123
0
23 Jun 2019
DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Ofir Nachum
Yinlam Chow
Bo Dai
Lihong Li
OffRL
13
328
0
10 Jun 2019
Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for Reinforcement Learning
Nathan Kallus
Masatoshi Uehara
OffRL
30
54
0
09 Jun 2019
Balanced off-policy evaluation in general action spaces
A. Sondhi
David Arbour
Drew Dimmery
OffRL
29
17
0
09 Jun 2019
Learning When-to-Treat Policies
Xinkun Nie
Emma Brunskill
Stefan Wager
CML
OffRL
26
89
0
23 May 2019
Combining Parametric and Nonparametric Models for Off-Policy Evaluation
Omer Gottesman
Yao Liu
Scott Sussex
Emma Brunskill
Finale Doshi-Velez
OffRL
33
33
0
14 May 2019
CAB: Continuous Adaptive Blending Estimator for Policy Evaluation and Learning
Yi-Hsun Su
Lequn Wang
Michele Santacatterina
Mohsen Guizani
CML
OffRL
15
6
0
06 Nov 2018
Behaviour Policy Estimation in Off-Policy Policy Evaluation: Calibration Matters
Aniruddh Raghu
Omer Gottesman
Yao Liu
Matthieu Komorowski
A. Faisal
Finale Doshi-Velez
Emma Brunskill
OffRL
33
33
0
03 Jul 2018
Representation Balancing MDPs for Off-Policy Policy Evaluation
Yao Liu
Omer Gottesman
Aniruddh Raghu
Matthieu Komorowski
A. Faisal
Finale Doshi-Velez
Emma Brunskill
OffRL
27
75
0
23 May 2018
Previous
1
2