Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2002.07729
Cited By
Adaptive Estimator Selection for Off-Policy Evaluation
18 February 2020
Yi-Hsun Su
Pavithra Srinath
A. Krishnamurthy
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Adaptive Estimator Selection for Off-Policy Evaluation"
5 / 5 papers shown
Title
Safe and Efficient Off-Policy Reinforcement Learning
Rémi Munos
T. Stepleton
Anna Harutyunyan
Marc G. Bellemare
OffRL
130
611
0
08 Jun 2016
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
186
5,056
0
05 Jun 2016
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
Philip S. Thomas
Emma Brunskill
OffRL
276
573
0
04 Apr 2016
Doubly Robust Policy Evaluation and Optimization
Miroslav Dudík
D. Erhan
John Langford
Lihong Li
OffRL
145
285
0
10 Mar 2015
Bandwidth selection in kernel density estimation: Oracle inequalities and adaptive minimax optimality
A. Goldenshluger
O. Lepski
269
245
0
06 Sep 2010
1