ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2002.07729
  4. Cited By
Adaptive Estimator Selection for Off-Policy Evaluation
v1v2 (latest)

Adaptive Estimator Selection for Off-Policy Evaluation

International Conference on Machine Learning (ICML), 2020
18 February 2020
Yi-Hsun Su
Pavithra Srinath
A. Krishnamurthy
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Adaptive Estimator Selection for Off-Policy Evaluation"

35 / 35 papers shown
A General Framework for Off-Policy Learning with Partially-Observed Reward
A General Framework for Off-Policy Learning with Partially-Observed RewardInternational Conference on Learning Representations (ICLR), 2025
Rikiya Takehi
Masahiro Asami
K. Kawakami
Yuta Saito
OffRL
215
1
0
17 Jun 2025
Off-Policy Evaluation of Ranking Policies via Embedding-Space User Behavior Modeling
Off-Policy Evaluation of Ranking Policies via Embedding-Space User Behavior Modeling
Tatsuki Takahashi
Chihiro Maru
Hiroko Shoji
OffRL
218
0
0
31 May 2025
Clustering Context in Off-Policy Evaluation
Clustering Context in Off-Policy EvaluationInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2025
Daniel Guzman-Olivares
Philipp Schmidt
Jacek Golebiowski
Artur Bekasov
CMLOffRL
217
2
0
28 Feb 2025
Off-Policy Selection for Initiating Human-Centric Experimental Design
Off-Policy Selection for Initiating Human-Centric Experimental DesignNeural Information Processing Systems (NeurIPS), 2024
Ge Gao
Xi Yang
Qitong Gao
Song Ju
Miroslav Pajic
Min Chi
OffRL
342
0
0
26 Oct 2024
Abstract Reward Processes: Leveraging State Abstraction for Consistent
  Off-Policy Evaluation
Abstract Reward Processes: Leveraging State Abstraction for Consistent Off-Policy EvaluationNeural Information Processing Systems (NeurIPS), 2024
Shreyas Chaudhari
Ameet Deshpande
Bruno Castro da Silva
Philip S. Thomas
OffRL
266
1
0
03 Oct 2024
Effective Off-Policy Evaluation and Learning in Contextual Combinatorial
  Bandits
Effective Off-Policy Evaluation and Learning in Contextual Combinatorial BanditsACM Conference on Recommender Systems (RecSys), 2024
Tatsuhiro Shimizu
Koichi Tanaka
Ren Kishimoto
Haruka Kiyohara
Masahiro Nomura
Yuta Saito
CMLOffRL
365
8
0
20 Aug 2024
AutoOPE: Automated Off-Policy Estimator Selection
AutoOPE: Automated Off-Policy Estimator Selection
Nicolò Felicioni
Michael Benigni
Maurizio Ferrari Dacrema
OffRL
217
2
0
26 Jun 2024
Kernel Metric Learning for In-Sample Off-Policy Evaluation of
  Deterministic RL Policies
Kernel Metric Learning for In-Sample Off-Policy Evaluation of Deterministic RL Policies
Haanvid Lee
Tri Wahyu Guntara
Jongmin Lee
Yung-Kyun Noh
Kee-Eung Kim
OffRL
291
3
0
29 May 2024
OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates
  of Multiple Estimators
OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators
Allen Nie
Yash Chandak
Christina J. Yuan
Anirudhan Badrinath
Yannis Flet-Berliac
Emma Brunskil
OffRL
293
4
0
27 May 2024
Cross-Validated Off-Policy Evaluation
Cross-Validated Off-Policy Evaluation
Matej Cief
Branislav Kveton
Michal Kompan
OffRL
363
2
0
24 May 2024
POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy
  Decomposition
POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition
Yuta Saito
Jihan Yao
Thorsten Joachims
OffRL
341
14
0
09 Feb 2024
Off-Policy Evaluation of Slate Bandit Policies via Optimizing
  Abstraction
Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction
Haruka Kiyohara
Masahiro Nomura
Yuta Saito
703
18
0
03 Feb 2024
When is Offline Policy Selection Sample Efficient for Reinforcement Learning?
When is Offline Policy Selection Sample Efficient for Reinforcement Learning?
Vincent Liu
P. Nagarajan
Andrew Patterson
Martha White
OffRL
433
3
0
04 Dec 2023
Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy
  Evaluation
Towards Assessing and Benchmarking Risk-Return Tradeoff of Off-Policy EvaluationInternational Conference on Learning Representations (ICLR), 2023
Haruka Kiyohara
Ren Kishimoto
K. Kawakami
Ken Kobayashi
Kazuhide Nakata
Yuta Saito
OffRL
520
15
0
30 Nov 2023
SCOPE-RL: A Python Library for Offline Reinforcement Learning and
  Off-Policy Evaluation
SCOPE-RL: A Python Library for Offline Reinforcement Learning and Off-Policy Evaluation
Haruka Kiyohara
Ren Kishimoto
K. Kawakami
Ken Kobayashi
Kazuhide Nakata
Yuta Saito
OffRLELM
547
5
0
30 Nov 2023
Off-Policy Evaluation for Large Action Spaces via Policy Convolution
Off-Policy Evaluation for Large Action Spaces via Policy ConvolutionThe Web Conference (WWW), 2023
Noveen Sachdeva
Lequn Wang
Dawen Liang
Nathan Kallus
Julian McAuley
OffRL
337
17
0
24 Oct 2023
Double Clipping: Less-Biased Variance Reduction in Off-Policy Evaluation
Double Clipping: Less-Biased Variance Reduction in Off-Policy Evaluation
Jan Malte Lichtenberg
Alexander K. Buchholz
Giuseppe Di Benedetto
M. Ruffini
Ben London
OffRL
201
4
0
03 Sep 2023
Doubly Robust Estimator for Off-Policy Evaluation with Large Action
  Spaces
Doubly Robust Estimator for Off-Policy Evaluation with Large Action SpacesIEEE Symposium Series on Computational Intelligence (IEEE-SSCI), 2023
Tatsuhiro Shimizu
L. Forastiere
OffRL
262
1
0
07 Aug 2023
On (Normalised) Discounted Cumulative Gain as an Off-Policy Evaluation
  Metric for Top-$n$ Recommendation
On (Normalised) Discounted Cumulative Gain as an Off-Policy Evaluation Metric for Top-nnn RecommendationKnowledge Discovery and Data Mining (KDD), 2023
Olivier Jeunen
Ivan Potapov
Aleksei Ustimenko
ELMOffRL
466
23
0
27 Jul 2023
Off-Policy Evaluation for Large Action Spaces via Conjunct Effect
  Modeling
Off-Policy Evaluation for Large Action Spaces via Conjunct Effect ModelingInternational Conference on Machine Learning (ICML), 2023
Yuta Saito
Qingyang Ren
Thorsten Joachims
CMLOffRL
355
33
0
14 May 2023
Policy-Adaptive Estimator Selection for Off-Policy Evaluation
Policy-Adaptive Estimator Selection for Off-Policy EvaluationAAAI Conference on Artificial Intelligence (AAAI), 2022
Takuma Udagawa
Haruka Kiyohara
Yusuke Narita
Yuta Saito
Keisuke Tateno
OffRL
299
29
0
25 Nov 2022
Oracle Inequalities for Model Selection in Offline Reinforcement
  Learning
Oracle Inequalities for Model Selection in Offline Reinforcement LearningNeural Information Processing Systems (NeurIPS), 2022
Jonathan Lee
George Tucker
Ofir Nachum
Bo Dai
Emma Brunskill
OffRL
391
14
0
03 Nov 2022
Local Metric Learning for Off-Policy Evaluation in Contextual Bandits
  with Continuous Actions
Local Metric Learning for Off-Policy Evaluation in Contextual Bandits with Continuous ActionsNeural Information Processing Systems (NeurIPS), 2022
Haanvid Lee
Jongmin Lee
Yunseon Choi
Wonseok Jeon
Byung-Jun Lee
Yung-Kyun Noh
Kee-Eung Kim
OffRL
330
7
0
24 Oct 2022
Data-Efficient Pipeline for Offline Reinforcement Learning with Limited
  Data
Data-Efficient Pipeline for Offline Reinforcement Learning with Limited DataNeural Information Processing Systems (NeurIPS), 2022
Allen Nie
Yannis Flet-Berliac
Deon R. Jordan
William Steenbergen
Emma Brunskill
OffRL
351
14
0
16 Oct 2022
Off-policy evaluation for learning-to-rank via interpolating the
  item-position model and the position-based model
Off-policy evaluation for learning-to-rank via interpolating the item-position model and the position-based model
Alexander K. Buchholz
Ben London
Giuseppe Di Benedetto
Thorsten Joachims
OffRL
206
2
0
15 Oct 2022
Off-Policy Evaluation for Large Action Spaces via Embeddings
Off-Policy Evaluation for Large Action Spaces via EmbeddingsInternational Conference on Machine Learning (ICML), 2022
Yuta Saito
Thorsten Joachims
OffRL
292
60
0
13 Feb 2022
Model Selection in Batch Policy Optimization
Model Selection in Batch Policy OptimizationInternational Conference on Machine Learning (ICML), 2021
Jonathan Lee
George Tucker
Ofir Nachum
Bo Dai
OffRL
252
12
0
23 Dec 2021
Pessimistic Model Selection for Offline Deep Reinforcement Learning
Pessimistic Model Selection for Offline Deep Reinforcement LearningConference on Uncertainty in Artificial Intelligence (UAI), 2021
Chao-Han Huck Yang
Zhengling Qi
Yifan Cui
Pin-Yu Chen
OffRL
320
4
0
29 Nov 2021
Off-Policy Evaluation in Partially Observed Markov Decision Processes
  under Sequential Ignorability
Off-Policy Evaluation in Partially Observed Markov Decision Processes under Sequential IgnorabilityAnnals of Statistics (Ann. Stat.), 2021
Yupeng Tang
Seung-seob Lee
OffRL
410
30
0
24 Oct 2021
Evaluating the Robustness of Off-Policy Evaluation
Evaluating the Robustness of Off-Policy EvaluationACM Conference on Recommender Systems (RecSys), 2021
Yuta Saito
Takuma Udagawa
Haruka Kiyohara
Kazuki Mogi
Yusuke Narita
Kei Tateno
ELMOffRL
321
48
0
31 Aug 2021
Improving Long-Term Metrics in Recommendation Systems using
  Short-Horizon Reinforcement Learning
Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Reinforcement Learning
Bogdan Mazoure
Paul Mineiro
Pavithra Srinath
R. S. Sedeh
Doina Precup
Adith Swaminathan
OffRL
268
5
0
01 Jun 2021
Deeply-Debiased Off-Policy Interval Estimation
Deeply-Debiased Off-Policy Interval EstimationInternational Conference on Machine Learning (ICML), 2021
C. Shi
Runzhe Wan
Victor Chernozhukov
R. Song
OffRL
270
43
0
10 May 2021
Optimal Mixture Weights for Off-Policy Evaluation with Multiple Behavior
  Policies
Optimal Mixture Weights for Off-Policy Evaluation with Multiple Behavior Policies
Jinlin Lai
Lixin Zou
Jiaxing Song
OffRL
97
1
0
29 Nov 2020
Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment
  Settings
Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment SettingsNeural Information Processing Systems (NeurIPS), 2020
Hengrui Cai
C. Shi
R. Song
Wenbin Lu
OffRL
397
16
0
29 Oct 2020
Optimal Off-Policy Evaluation from Multiple Logging Policies
Optimal Off-Policy Evaluation from Multiple Logging Policies
Nathan Kallus
Yuta Saito
Masatoshi Uehara
OffRL
354
44
0
21 Oct 2020
1
Page 1 of 1