Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2002.07729
Cited By
v1
v2 (latest)
Adaptive Estimator Selection for Off-Policy Evaluation
International Conference on Machine Learning (ICML), 2020
18 February 2020
Yi-Hsun Su
Pavithra Srinath
A. Krishnamurthy
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Adaptive Estimator Selection for Off-Policy Evaluation"
35 / 35 papers shown
A General Framework for Off-Policy Learning with Partially-Observed Reward
International Conference on Learning Representations (ICLR), 2025
Rikiya Takehi
Masahiro Asami
K. Kawakami
Yuta Saito
OffRL
215
1
0
17 Jun 2025
Off-Policy Evaluation of Ranking Policies via Embedding-Space User Behavior Modeling
Tatsuki Takahashi
Chihiro Maru
Hiroko Shoji
OffRL
218
0
0
31 May 2025
Clustering Context in Off-Policy Evaluation
International Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Daniel Guzman-Olivares
Philipp Schmidt
Jacek Golebiowski
Artur Bekasov
CML
OffRL
217
2
0
28 Feb 2025
Off-Policy Selection for Initiating Human-Centric Experimental Design
Neural Information Processing Systems (NeurIPS), 2024
Ge Gao
Xi Yang
Qitong Gao
Song Ju
Miroslav Pajic
Min Chi
OffRL
342
0
0
26 Oct 2024
Abstract Reward Processes: Leveraging State Abstraction for Consistent Off-Policy Evaluation
Neural Information Processing Systems (NeurIPS), 2024
Shreyas Chaudhari
Ameet Deshpande
Bruno Castro da Silva
Philip S. Thomas
OffRL
266
1
0
03 Oct 2024
Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits
ACM Conference on Recommender Systems (RecSys), 2024
Tatsuhiro Shimizu
Koichi Tanaka
Ren Kishimoto
Haruka Kiyohara
Masahiro Nomura
Yuta Saito
CML
OffRL
365
8
0
20 Aug 2024
AutoOPE: Automated Off-Policy Estimator Selection
Nicolò Felicioni
Michael Benigni
Maurizio Ferrari Dacrema
OffRL
217
2
0
26 Jun 2024
Kernel Metric Learning for In-Sample Off-Policy Evaluation of Deterministic RL Policies
Haanvid Lee
Tri Wahyu Guntara
Jongmin Lee
Yung-Kyun Noh
Kee-Eung Kim
OffRL
291
3
0
29 May 2024
OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators
Allen Nie
Yash Chandak
Christina J. Yuan
Anirudhan Badrinath
Yannis Flet-Berliac
Emma Brunskil
OffRL
293
4
0
27 May 2024
Cross-Validated Off-Policy Evaluation
Matej Cief
Branislav Kveton
Michal Kompan
OffRL
363
2
0
24 May 2024
POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition
Yuta Saito
Jihan Yao
Thorsten Joachims
OffRL
341
14
0
09 Feb 2024
Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction
Haruka Kiyohara
Masahiro Nomura
Yuta Saito
703
18
0
03 Feb 2024
When is Offline Policy Selection Sample Efficient for Reinforcement Learning?
Vincent Liu
P. Nagarajan
Andrew Patterson
Martha White
OffRL
433
3
0
04 Dec 2023
Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy Evaluation
International Conference on Learning Representations (ICLR), 2023
Haruka Kiyohara
Ren Kishimoto
K. Kawakami
Ken Kobayashi
Kazuhide Nakata
Yuta Saito
OffRL
520
15
0
30 Nov 2023
SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation
Haruka Kiyohara
Ren Kishimoto
K. Kawakami
Ken Kobayashi
Kazuhide Nakata
Yuta Saito
OffRL
ELM
547
5
0
30 Nov 2023
Off-Policy Evaluation for Large Action Spaces via Policy Convolution
The Web Conference (WWW), 2023
Noveen Sachdeva
Lequn Wang
Dawen Liang
Nathan Kallus
Julian McAuley
OffRL
337
17
0
24 Oct 2023
Double Clipping: Less-Biased Variance Reduction in Off-Policy Evaluation
Jan Malte Lichtenberg
Alexander K. Buchholz
Giuseppe Di Benedetto
M. Ruffini
Ben London
OffRL
201
4
0
03 Sep 2023
Doubly Robust Estimator for Off-Policy Evaluation with Large Action Spaces
IEEE Symposium Series on Computational Intelligence (IEEE-SSCI), 2023
Tatsuhiro Shimizu
L. Forastiere
OffRL
262
1
0
07 Aug 2023
On (Normalised) Discounted Cumulative Gain as an Off-Policy Evaluation Metric for Top-
n
n
n
Recommendation
Knowledge Discovery and Data Mining (KDD), 2023
Olivier Jeunen
Ivan Potapov
Aleksei Ustimenko
ELM
OffRL
466
23
0
27 Jul 2023
Off-Policy Evaluation for Large Action Spaces via Conjunct Effect Modeling
International Conference on Machine Learning (ICML), 2023
Yuta Saito
Qingyang Ren
Thorsten Joachims
CML
OffRL
355
33
0
14 May 2023
Policy-Adaptive Estimator Selection for Off-Policy Evaluation
AAAI Conference on Artificial Intelligence (AAAI), 2022
Takuma Udagawa
Haruka Kiyohara
Yusuke Narita
Yuta Saito
Keisuke Tateno
OffRL
299
29
0
25 Nov 2022
Oracle Inequalities for Model Selection in Offline Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2022
Jonathan Lee
George Tucker
Ofir Nachum
Bo Dai
Emma Brunskill
OffRL
391
14
0
03 Nov 2022
Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous Actions
Neural Information Processing Systems (NeurIPS), 2022
Haanvid Lee
Jongmin Lee
Yunseon Choi
Wonseok Jeon
Byung-Jun Lee
Yung-Kyun Noh
Kee-Eung Kim
OffRL
330
7
0
24 Oct 2022
Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data
Neural Information Processing Systems (NeurIPS), 2022
Allen Nie
Yannis Flet-Berliac
Deon R. Jordan
William Steenbergen
Emma Brunskill
OffRL
351
14
0
16 Oct 2022
Off-policy evaluation for learning-to-rank via interpolating the item-position model and the position-based model
Alexander K. Buchholz
Ben London
Giuseppe Di Benedetto
Thorsten Joachims
OffRL
206
2
0
15 Oct 2022
Off-Policy Evaluation for Large Action Spaces via Embeddings
International Conference on Machine Learning (ICML), 2022
Yuta Saito
Thorsten Joachims
OffRL
292
60
0
13 Feb 2022
Model Selection in Batch Policy Optimization
International Conference on Machine Learning (ICML), 2021
Jonathan Lee
George Tucker
Ofir Nachum
Bo Dai
OffRL
252
12
0
23 Dec 2021
Pessimistic Model Selection for Offline Deep Reinforcement Learning
Conference on Uncertainty in Artificial Intelligence (UAI), 2021
Chao-Han Huck Yang
Zhengling Qi
Yifan Cui
Pin-Yu Chen
OffRL
320
4
0
29 Nov 2021
Off-Policy Evaluation in Partially Observed Markov Decision Processes under Sequential Ignorability
Annals of Statistics (Ann. Stat.), 2021
Yupeng Tang
Seung-seob Lee
OffRL
410
30
0
24 Oct 2021
Evaluating the Robustness of Off-Policy Evaluation
ACM Conference on Recommender Systems (RecSys), 2021
Yuta Saito
Takuma Udagawa
Haruka Kiyohara
Kazuki Mogi
Yusuke Narita
Kei Tateno
ELM
OffRL
321
48
0
31 Aug 2021
Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Reinforcement Learning
Bogdan Mazoure
Paul Mineiro
Pavithra Srinath
R. S. Sedeh
Doina Precup
Adith Swaminathan
OffRL
268
5
0
01 Jun 2021
Deeply-Debiased Off-Policy Interval Estimation
International Conference on Machine Learning (ICML), 2021
C. Shi
Runzhe Wan
Victor Chernozhukov
R. Song
OffRL
270
43
0
10 May 2021
Optimal Mixture Weights for Off-Policy Evaluation with Multiple Behavior Policies
Jinlin Lai
Lixin Zou
Jiaxing Song
OffRL
97
1
0
29 Nov 2020
Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment Settings
Neural Information Processing Systems (NeurIPS), 2020
Hengrui Cai
C. Shi
R. Song
Wenbin Lu
OffRL
397
16
0
29 Oct 2020
Optimal Off-Policy Evaluation from Multiple Logging Policies
Nathan Kallus
Yuta Saito
Masatoshi Uehara
OffRL
354
44
0
21 Oct 2020
1
Page 1 of 1