Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2111.04835
Cited By
Safe Data Collection for Offline and Online Policy Learning
8 November 2021
Ruihao Zhu
B. Kveton
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Safe Data Collection for Offline and Online Policy Learning"
5 / 5 papers shown
Title
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Subhojyoti Mukherjee
Josiah P. Hanna
Robert Nowak
OffRL
51
0
0
04 Jun 2024
Efficient and Interpretable Bandit Algorithms
Subhojyoti Mukherjee
Ruihao Zhu
B. Kveton
FAtt
23
2
0
23 Oct 2023
SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits
Subhojyoti Mukherjee
Qiaomin Xie
Josiah P. Hanna
R. Nowak
OffRL
53
5
0
29 Jan 2023
Risk-Aware Linear Bandits: Theory and Applications in Smart Order Routing
Jingwei Ji
Renyuan Xu
Ruihao Zhu
20
0
0
04 Aug 2022
Nonparametric Pricing Analytics with Customer Covariates
Ningyuan Chen
G. Gallego
59
39
0
03 May 2018
1