Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.04204
Cited By
v1
v2 (latest)
Semiparametric Contextual Bandits
12 March 2018
A. Krishnamurthy
Zhiwei Steven Wu
Vasilis Syrgkanis
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Semiparametric Contextual Bandits"
22 / 22 papers shown
Title
Experimental Design for Semiparametric Bandits
Seok-Jin Kim
Gi-Soo Kim
Min-hwan Oh
21
0
0
16 Jun 2025
Zero-Inflated Bandits
Haoyu Wei
Runzhe Wan
Lei Shi
Rui Song
102
0
0
25 Dec 2023
Selective Uncertainty Propagation in Offline RL
Sanath Kumar Krishnamurthy
Shrey Modi
Tanmay Gangwani
S. Katariya
Branislav Kveton
A. Rangi
OffRL
216
0
0
01 Feb 2023
GBOSE: Generalized Bandit Orthogonalized Semiparametric Estimation
Mubarrat Chowdhury
Elkhan Ismayilzada
Khalequzzaman Sayem
Gi-Soo Kim
51
1
0
20 Jan 2023
Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach
Miao Lu
Wenhao Yang
Liangyu Zhang
Zhihua Zhang
OffRL
73
1
0
12 Sep 2022
Semi-Parametric Contextual Bandits with Graph-Laplacian Regularization
Y. Choi
Gi-Soo Kim
Seung-Jin Paik
M. Paik
62
6
0
17 May 2022
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect Oracles
Aldo G. Carranza
Sanath Kumar Krishnamurthy
Susan Athey
48
1
0
30 Mar 2022
Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive Interventions
Nina Deliu
Joseph Jay Williams
B. Chakraborty
OffRL
65
5
0
04 Mar 2022
A Robust Phased Elimination Algorithm for Corruption-Tolerant Gaussian Process Bandits
Ilija Bogunovic
Zihan Li
Andreas Krause
Jonathan Scarlett
79
9
0
03 Feb 2022
Model Selection for Generic Contextual Bandits
Avishek Ghosh
Abishek Sankararaman
Kannan Ramchandran
76
6
0
07 Jul 2021
Bias-Robust Bayesian Optimization via Dueling Bandits
Johannes Kirschner
Andreas Krause
50
11
0
25 May 2021
Problem-Complexity Adaptive Model Selection for Stochastic Linear Bandits
Avishek Ghosh
Abishek Sankararaman
Kannan Ramchandran
70
34
0
04 Jun 2020
Bandits with adversarial scaling
Thodoris Lykouris
Vahab Mirrokni
R. Leme
70
14
0
04 Mar 2020
Learning Near Optimal Policies with Low Inherent Bellman Error
Andrea Zanette
A. Lazaric
Mykel Kochenderfer
Emma Brunskill
OffRL
91
222
0
29 Feb 2020
A Finite-Sample Deviation Bound for Stable Autoregressive Processes
Rodrigo A. González
C. Rojas
42
5
0
17 Dec 2019
The Nonstochastic Control Problem
Elad Hazan
Sham Kakade
Karan Singh
72
120
0
27 Nov 2019
Corruption-robust exploration in episodic reinforcement learning
Thodoris Lykouris
Max Simchowitz
Aleksandrs Slivkins
Wen Sun
103
105
0
20 Nov 2019
Personalized HeartSteps: A Reinforcement Learning Algorithm for Optimizing Physical Activity
Peng Liao
Kristjan Greenewald
P. Klasnja
Susan Murphy
67
85
0
08 Sep 2019
Model selection for contextual bandits
Dylan J. Foster
A. Krishnamurthy
Haipeng Luo
OffRL
216
90
0
03 Jun 2019
Learning Linear Dynamical Systems with Semi-Parametric Least Squares
Max Simchowitz
Ross Boczar
Benjamin Recht
73
116
0
02 Feb 2019
Contextual Multi-armed Bandit Algorithm for Semiparametric Reward Model
Gi-Soo Kim
M. Paik
126
15
0
31 Jan 2019
Semi-parametric dynamic contextual pricing
Virag Shah
Jose H. Blanchet
Ramesh Johari
81
36
0
07 Jan 2019
1