Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2301.12357
Cited By
SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits
29 January 2023
Subhojyoti Mukherjee
Qiaomin Xie
Josiah P. Hanna
R. Nowak
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits"
7 / 7 papers shown
Title
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Subhojyoti Mukherjee
Josiah P. Hanna
Robert Nowak
OffRL
22
0
0
04 Jun 2024
Optimal Design for Human Feedback
Subhojyoti Mukherjee
Anusha Lalitha
Kousha Kalantari
Aniket Deshmukh
Ge Liu
Yifei Ma
B. Kveton
23
0
0
22 Apr 2024
Multi-task Representation Learning for Pure Exploration in Bilinear Bandits
Subhojyoti Mukherjee
Qiaomin Xie
Josiah P. Hanna
Robert D. Nowak
29
5
0
01 Nov 2023
Experimental Designs for Heteroskedastic Variance
Justin Weltz
Tanner Fiez
Alex Volfovsky
Eric B. Laber
Blake Mason
Houssam Nassif
Lalit P. Jain
11
3
0
06 Oct 2023
Computationally Efficient Horizon-Free Reinforcement Learning for Linear Mixture MDPs
Dongruo Zhou
Quanquan Gu
73
43
0
23 May 2022
Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP
Zihan Zhang
Jiaqi Yang
Xiangyang Ji
S. Du
54
36
0
29 Jan 2021
Minimax Number of Strata for Online Stratified Sampling given Noisy Samples
Alexandra Carpentier
Rémi Munos
44
13
0
18 May 2012
1