Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1604.00923
Cited By
Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning
4 April 2016
Philip S. Thomas
Emma Brunskill
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning"
11 / 11 papers shown
Title
DOLCE: Decomposing Off-Policy Evaluation/Learning into Lagged and Current Effects
Shu Tamano
Masanori Nojima
OffRL
111
0
0
02 May 2025
Statistical Inference in Reinforcement Learning: A Selective Survey
Chengchun Shi
OffRL
151
2
0
22 Feb 2025
Doubly Optimal Policy Evaluation for Reinforcement Learning
Shuze Liu
Claire Chen
Shangtong Zhang
OffRL
109
3
0
03 Oct 2024
Evaluation of Active Feature Acquisition Methods for Time-varying Feature Settings
Henrik von Kleist
Alireza Zamanian
I. Shpitser
Narges Ahmidi
OffRL
119
2
0
03 Dec 2023
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation
Yuta Saito
Shunsuke Aihara
Megumi Matsutani
Yusuke Narita
OffRL
90
74
0
17 Aug 2020
Off-policy Bandits with Deficient Support
Noveen Sachdeva
Yi-Hsun Su
Thorsten Joachims
OffRL
93
75
0
16 Jun 2020
Policy Learning with Observational Data
Susan Athey
Stefan Wager
CML
OffRL
156
182
0
09 Feb 2017
A Notation for Markov Decision Processes
Philip S. Thomas
Billy Okal
57
17
0
30 Dec 2015
Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
Nan Jiang
Lihong Li
OffRL
113
621
0
11 Nov 2015
Emphatic Temporal-Difference Learning
A. R. Mahmood
Huizhen Yu
Martha White
R. Sutton
75
33
0
06 Jul 2015
Doubly Robust Policy Evaluation and Learning
Miroslav Dudík
John Langford
Lihong Li
OffRL
118
694
0
23 Mar 2011
1