ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1807.01066
  4. Cited By
Behaviour Policy Estimation in Off-Policy Policy Evaluation: Calibration
  Matters

Behaviour Policy Estimation in Off-Policy Policy Evaluation: Calibration Matters

3 July 2018
Aniruddh Raghu
Omer Gottesman
Yao Liu
Matthieu Komorowski
A. Faisal
Finale Doshi-Velez
Emma Brunskill
    OffRL
ArXivPDFHTML

Papers citing "Behaviour Policy Estimation in Off-Policy Policy Evaluation: Calibration Matters"

6 / 6 papers shown
Title
Offline Policy Optimization with Eligible Actions
Offline Policy Optimization with Eligible Actions
Yao Liu
Yannis Flet-Berliac
Emma Brunskill
OffRL
25
5
0
01 Jul 2022
Benchmarks for Deep Off-Policy Evaluation
Benchmarks for Deep Off-Policy Evaluation
Justin Fu
Mohammad Norouzi
Ofir Nachum
George Tucker
Ziyun Wang
...
Yutian Chen
Aviral Kumar
Cosmin Paduraru
Sergey Levine
T. Paine
ELM
OffRL
35
100
0
30 Mar 2021
Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation
  for Reinforcement Learning
Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning
Ming Yin
Yu Bai
Yu-Xiang Wang
OffRL
30
31
0
07 Jul 2020
Counterfactual Data Augmentation using Locally Factored Dynamics
Counterfactual Data Augmentation using Locally Factored Dynamics
Silviu Pitis
Elliot Creager
Animesh Garg
BDL
OffRL
21
85
0
06 Jul 2020
POPCORN: Partially Observed Prediction COnstrained ReiNforcement
  Learning
POPCORN: Partially Observed Prediction COnstrained ReiNforcement Learning
Joseph D. Futoma
M. C. Hughes
Finale Doshi-Velez
OffRL
21
49
0
13 Jan 2020
Identifying Distinct, Effective Treatments for Acute Hypotension with
  SODA-RL: Safely Optimized Diverse Accurate Reinforcement Learning
Identifying Distinct, Effective Treatments for Acute Hypotension with SODA-RL: Safely Optimized Diverse Accurate Reinforcement Learning
Joseph D. Futoma
M. A. Masood
Finale Doshi-Velez
OffRL
OOD
16
11
0
09 Jan 2020
1