ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.02553
  4. Cited By
Smooth Contextual Bandits: Bridging the Parametric and
  Non-differentiable Regret Regimes
v1v2v3v4 (latest)

Smooth Contextual Bandits: Bridging the Parametric and Non-differentiable Regret Regimes

5 September 2019
Yichun Hu
Nathan Kallus
Xiaojie Mao
ArXiv (abs)PDFHTML

Papers citing "Smooth Contextual Bandits: Bridging the Parametric and Non-differentiable Regret Regimes"

12 / 12 papers shown
Title
Contextual Bandits for Unbounded Context Distributions
Contextual Bandits for Unbounded Context Distributions
Puning Zhao
Xiaogang Xu
Zhe Liu
Huiwen Wu
Qin Zhang
Zong Ke
Tianhang Zheng
304
10
0
19 Aug 2024
Batched Nonparametric Contextual Bandits
Batched Nonparametric Contextual Bandits
Rong Jiang
Cong Ma
OffRL
117
1
0
27 Feb 2024
Kernel $ε$-Greedy for Multi-Armed Bandits with Covariates
Kernel εεε-Greedy for Multi-Armed Bandits with Covariates
Sakshi Arya
Bharath K. Sriperumbudur
139
0
0
29 Jun 2023
Optimal Contextual Bandits with Knapsacks under Realizability via
  Regression Oracles
Optimal Contextual Bandits with Knapsacks under Realizability via Regression Oracles
Yuxuan Han
Jialin Zeng
Yang Wang
Yangzhen Xiang
Jiheng Zhang
103
9
0
21 Oct 2022
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment
  Effect Oracles
Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect Oracles
Aldo G. Carranza
Sanath Kumar Krishnamurthy
Susan Athey
50
1
0
30 Mar 2022
Analysis of Thompson Sampling for Partially Observable Contextual
  Multi-Armed Bandits
Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits
Yash J. Patel
Mohamad Kazem Shirani Faradonbeh
67
15
0
23 Oct 2021
Multi-armed Bandit Requiring Monotone Arm Sequences
Multi-armed Bandit Requiring Monotone Arm Sequences
Ningyuan Chen
133
11
0
07 Jun 2021
Risk Minimization from Adaptively Collected Data: Guarantees for
  Supervised and Policy Learning
Risk Minimization from Adaptively Collected Data: Guarantees for Supervised and Policy Learning
Aurélien F. Bibaut
Antoine Chambaz
Maria Dimakopoulou
Nathan Kallus
Mark van der Laan
OffRL
91
15
0
03 Jun 2021
Instance-Dependent Bounds for Zeroth-order Lipschitz Optimization with
  Error Certificates
Instance-Dependent Bounds for Zeroth-order Lipschitz Optimization with Error Certificates
François Bachoc
Tommaso Cesari
Sébastien Gerchinovitz
73
10
0
03 Feb 2021
Fast Rates for the Regret of Offline Reinforcement Learning
Fast Rates for the Regret of Offline Reinforcement Learning
Yichun Hu
Nathan Kallus
Masatoshi Uehara
OffRL
119
30
0
31 Jan 2021
Smooth Bandit Optimization: Generalization to Hölder Space
Smooth Bandit Optimization: Generalization to Hölder Space
Yusha Liu
Yining Wang
Aarti Singh
60
10
0
11 Dec 2020
DTR Bandit: Learning to Make Response-Adaptive Decisions With Low Regret
DTR Bandit: Learning to Make Response-Adaptive Decisions With Low Regret
Yichun Hu
Nathan Kallus
OffRL
17
0
0
06 May 2020
1