Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.12320
Cited By
Model Selection in Batch Policy Optimization
23 December 2021
Jonathan Lee
George Tucker
Ofir Nachum
Bo Dai
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Model Selection in Batch Policy Optimization"
7 / 7 papers shown
Title
Model Selection for Off-policy Evaluation: New Algorithms and Experimental Protocol
Pai Liu
Lingfeng Zhao
Shivangi Agarwal
Jinghan Liu
Audrey Huang
P. Amortila
Nan Jiang
OODD
OffRL
96
0
0
11 Feb 2025
Cross-Validated Off-Policy Evaluation
Matej Cief
B. Kveton
Michal Kompan
OffRL
20
1
0
24 May 2024
Estimating Optimal Policy Value in General Linear Contextual Bandits
Jonathan Lee
Weihao Kong
Aldo Pacchiano
Vidya Muthukumar
Emma Brunskill
13
0
0
19 Feb 2023
A Workflow for Offline Model-Free Robotic Reinforcement Learning
Aviral Kumar
Anika Singh
Stephen Tian
Chelsea Finn
Sergey Levine
OffRL
138
84
0
22 Sep 2021
Provably Efficient Reinforcement Learning with Linear Function Approximation Under Adaptivity Constraints
Chi Jin
Zhuoran Yang
Zhaoran Wang
OffRL
107
166
0
06 Jan 2021
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
329
1,949
0
04 May 2020
Bounded regret in stochastic multi-armed bandits
Sébastien Bubeck
Vianney Perchet
Philippe Rigollet
56
90
0
06 Feb 2013
1