Smooth Contextual Bandits: Bridging the Parametric and
Non-differentiable Regret Regimes

v1v2v3v4 (latest)

Smooth Contextual Bandits: Bridging the Parametric and Non-differentiable Regret Regimes

5 September 2019

ArXiv (abs)PDF HTML

Papers citing "Smooth Contextual Bandits: Bridging the Parametric and Non-differentiable Regret Regimes"

12 / 12 papers shown

Title
Contextual Bandits for Unbounded Context Distributions Puning Zhao Xiaogang Xu Zhe Liu Huiwen Wu Qin Zhang Zong Ke Tianhang Zheng 304 10 0 19 Aug 2024
Batched Nonparametric Contextual Bandits Rong Jiang Cong Ma OffRL 117 1 0 27 Feb 2024
Kernel $ε$ -Greedy for Multi-Armed Bandits with Covariates Sakshi Arya Bharath K. Sriperumbudur 139 0 0 29 Jun 2023
Optimal Contextual Bandits with Knapsacks under Realizability via Regression Oracles Yuxuan Han Jialin Zeng Yang Wang Yangzhen Xiang Jiheng Zhang 103 9 0 21 Oct 2022
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect Oracles Aldo G. Carranza Sanath Kumar Krishnamurthy Susan Athey 50 1 0 30 Mar 2022
Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits Yash J. Patel Mohamad Kazem Shirani Faradonbeh 67 15 0 23 Oct 2021
Multi-armed Bandit Requiring Monotone Arm Sequences Ningyuan Chen 133 11 0 07 Jun 2021
Risk Minimization from Adaptively Collected Data: Guarantees for Supervised and Policy Learning Aurélien F. Bibaut Antoine Chambaz Maria Dimakopoulou Nathan Kallus Mark van der Laan OffRL 91 15 0 03 Jun 2021
Instance-Dependent Bounds for Zeroth-order Lipschitz Optimization with Error Certificates François Bachoc Tommaso Cesari Sébastien Gerchinovitz 73 10 0 03 Feb 2021
Fast Rates for the Regret of Offline Reinforcement Learning Yichun Hu Nathan Kallus Masatoshi Uehara OffRL 119 30 0 31 Jan 2021
Smooth Bandit Optimization: Generalization to Hölder Space Yusha Liu Yining Wang Aarti Singh 60 10 0 11 Dec 2020
DTR Bandit: Learning to Make Response-Adaptive Decisions With Low Regret Yichun Hu Nathan Kallus OffRL 17 0 0 06 May 2020