v1v2 (latest)

Semiparametric Contextual Bandits

12 March 2018

Papers citing "Semiparametric Contextual Bandits"

22 / 22 papers shown

Title
Experimental Design for Semiparametric Bandits Seok-Jin Kim Gi-Soo Kim Min-hwan Oh 21 0 0 16 Jun 2025
Zero-Inflated Bandits Haoyu Wei Runzhe Wan Lei Shi Rui Song 104 0 0 25 Dec 2023
Selective Uncertainty Propagation in Offline RL Sanath Kumar Krishnamurthy Shrey Modi Tanmay Gangwani S. Katariya Branislav Kveton A. Rangi OffRL 216 0 0 01 Feb 2023
GBOSE: Generalized Bandit Orthogonalized Semiparametric Estimation Mubarrat Chowdhury Elkhan Ismayilzada Khalequzzaman Sayem Gi-Soo Kim 51 1 0 20 Jan 2023
Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach Miao Lu Wenhao Yang Liangyu Zhang Zhihua Zhang OffRL 73 1 0 12 Sep 2022
Semi-Parametric Contextual Bandits with Graph-Laplacian Regularization Y. Choi Gi-Soo Kim Seung-Jin Paik M. Paik 62 6 0 17 May 2022
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect Oracles Aldo G. Carranza Sanath Kumar Krishnamurthy Susan Athey 48 1 0 30 Mar 2022
Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive Interventions Nina Deliu Joseph Jay Williams B. Chakraborty OffRL 65 5 0 04 Mar 2022
A Robust Phased Elimination Algorithm for Corruption-Tolerant Gaussian Process Bandits Ilija Bogunovic Zihan Li Andreas Krause Jonathan Scarlett 79 9 0 03 Feb 2022
Model Selection for Generic Contextual Bandits Avishek Ghosh Abishek Sankararaman Kannan Ramchandran 76 6 0 07 Jul 2021
Bias-Robust Bayesian Optimization via Dueling Bandits Johannes Kirschner Andreas Krause 52 11 0 25 May 2021
Problem-Complexity Adaptive Model Selection for Stochastic Linear Bandits Avishek Ghosh Abishek Sankararaman Kannan Ramchandran 70 34 0 04 Jun 2020
Bandits with adversarial scaling Thodoris Lykouris Vahab Mirrokni R. Leme 70 14 0 04 Mar 2020
Learning Near Optimal Policies with Low Inherent Bellman Error Andrea Zanette A. Lazaric Mykel Kochenderfer Emma Brunskill OffRL 91 222 0 29 Feb 2020
A Finite-Sample Deviation Bound for Stable Autoregressive Processes Rodrigo A. González C. Rojas 42 5 0 17 Dec 2019
The Nonstochastic Control Problem Elad Hazan Sham Kakade Karan Singh 72 120 0 27 Nov 2019
Corruption-robust exploration in episodic reinforcement learning Thodoris Lykouris Max Simchowitz Aleksandrs Slivkins Wen Sun 105 105 0 20 Nov 2019
Personalized HeartSteps: A Reinforcement Learning Algorithm for Optimizing Physical Activity Peng Liao Kristjan Greenewald P. Klasnja Susan Murphy 67 85 0 08 Sep 2019
Model selection for contextual bandits Dylan J. Foster A. Krishnamurthy Haipeng Luo OffRL 216 90 0 03 Jun 2019
Learning Linear Dynamical Systems with Semi-Parametric Least Squares Max Simchowitz Ross Boczar Benjamin Recht 73 116 0 02 Feb 2019
Contextual Multi-armed Bandit Algorithm for Semiparametric Reward Model Gi-Soo Kim M. Paik 128 15 0 31 Jan 2019
Semi-parametric dynamic contextual pricing Virag Shah Jose H. Blanchet Ramesh Johari 81 36 0 07 Jan 2019