ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.04204
  4. Cited By
Semiparametric Contextual Bandits
v1v2 (latest)

Semiparametric Contextual Bandits

12 March 2018
A. Krishnamurthy
Zhiwei Steven Wu
Vasilis Syrgkanis
ArXiv (abs)PDFHTML

Papers citing "Semiparametric Contextual Bandits"

22 / 22 papers shown
Title
Experimental Design for Semiparametric Bandits
Experimental Design for Semiparametric Bandits
Seok-Jin Kim
Gi-Soo Kim
Min-hwan Oh
21
0
0
16 Jun 2025
Zero-Inflated Bandits
Zero-Inflated Bandits
Haoyu Wei
Runzhe Wan
Lei Shi
Rui Song
104
0
0
25 Dec 2023
Selective Uncertainty Propagation in Offline RL
Selective Uncertainty Propagation in Offline RL
Sanath Kumar Krishnamurthy
Shrey Modi
Tanmay Gangwani
S. Katariya
Branislav Kveton
A. Rangi
OffRL
216
0
0
01 Feb 2023
GBOSE: Generalized Bandit Orthogonalized Semiparametric Estimation
GBOSE: Generalized Bandit Orthogonalized Semiparametric Estimation
Mubarrat Chowdhury
Elkhan Ismayilzada
Khalequzzaman Sayem
Gi-Soo Kim
51
1
0
20 Jan 2023
Statistical Estimation of Confounded Linear MDPs: An Instrumental
  Variable Approach
Statistical Estimation of Confounded Linear MDPs: An Instrumental Variable Approach
Miao Lu
Wenhao Yang
Liangyu Zhang
Zhihua Zhang
OffRL
73
1
0
12 Sep 2022
Semi-Parametric Contextual Bandits with Graph-Laplacian Regularization
Semi-Parametric Contextual Bandits with Graph-Laplacian Regularization
Y. Choi
Gi-Soo Kim
Seung-Jin Paik
M. Paik
62
6
0
17 May 2022
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment
  Effect Oracles
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect Oracles
Aldo G. Carranza
Sanath Kumar Krishnamurthy
Susan Athey
48
1
0
30 Mar 2022
Reinforcement Learning in Modern Biostatistics: Constructing Optimal
  Adaptive Interventions
Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive Interventions
Nina Deliu
Joseph Jay Williams
B. Chakraborty
OffRL
65
5
0
04 Mar 2022
A Robust Phased Elimination Algorithm for Corruption-Tolerant Gaussian
  Process Bandits
A Robust Phased Elimination Algorithm for Corruption-Tolerant Gaussian Process Bandits
Ilija Bogunovic
Zihan Li
Andreas Krause
Jonathan Scarlett
79
9
0
03 Feb 2022
Model Selection for Generic Contextual Bandits
Model Selection for Generic Contextual Bandits
Avishek Ghosh
Abishek Sankararaman
Kannan Ramchandran
76
6
0
07 Jul 2021
Bias-Robust Bayesian Optimization via Dueling Bandits
Bias-Robust Bayesian Optimization via Dueling Bandits
Johannes Kirschner
Andreas Krause
52
11
0
25 May 2021
Problem-Complexity Adaptive Model Selection for Stochastic Linear
  Bandits
Problem-Complexity Adaptive Model Selection for Stochastic Linear Bandits
Avishek Ghosh
Abishek Sankararaman
Kannan Ramchandran
70
34
0
04 Jun 2020
Bandits with adversarial scaling
Bandits with adversarial scaling
Thodoris Lykouris
Vahab Mirrokni
R. Leme
70
14
0
04 Mar 2020
Learning Near Optimal Policies with Low Inherent Bellman Error
Learning Near Optimal Policies with Low Inherent Bellman Error
Andrea Zanette
A. Lazaric
Mykel Kochenderfer
Emma Brunskill
OffRL
91
222
0
29 Feb 2020
A Finite-Sample Deviation Bound for Stable Autoregressive Processes
A Finite-Sample Deviation Bound for Stable Autoregressive Processes
Rodrigo A. González
C. Rojas
42
5
0
17 Dec 2019
The Nonstochastic Control Problem
The Nonstochastic Control Problem
Elad Hazan
Sham Kakade
Karan Singh
72
120
0
27 Nov 2019
Corruption-robust exploration in episodic reinforcement learning
Corruption-robust exploration in episodic reinforcement learning
Thodoris Lykouris
Max Simchowitz
Aleksandrs Slivkins
Wen Sun
105
105
0
20 Nov 2019
Personalized HeartSteps: A Reinforcement Learning Algorithm for
  Optimizing Physical Activity
Personalized HeartSteps: A Reinforcement Learning Algorithm for Optimizing Physical Activity
Peng Liao
Kristjan Greenewald
P. Klasnja
Susan Murphy
67
85
0
08 Sep 2019
Model selection for contextual bandits
Model selection for contextual bandits
Dylan J. Foster
A. Krishnamurthy
Haipeng Luo
OffRL
216
90
0
03 Jun 2019
Learning Linear Dynamical Systems with Semi-Parametric Least Squares
Learning Linear Dynamical Systems with Semi-Parametric Least Squares
Max Simchowitz
Ross Boczar
Benjamin Recht
73
116
0
02 Feb 2019
Contextual Multi-armed Bandit Algorithm for Semiparametric Reward Model
Contextual Multi-armed Bandit Algorithm for Semiparametric Reward Model
Gi-Soo Kim
M. Paik
128
15
0
31 Jan 2019
Semi-parametric dynamic contextual pricing
Semi-parametric dynamic contextual pricing
Virag Shah
Jose H. Blanchet
Ramesh Johari
81
36
0
07 Jan 2019
1