ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2306.17329
135
0
v1v2 (latest)

Kernel εεε-Greedy for Contextual Bandits

29 June 2023
Sakshi Arya
Bharath K. Sriperumbudur
ArXiv (abs)PDFHTML
Main:35 Pages
3 Figures
Bibliography:5 Pages
Abstract

We consider a kernelized version of the ϵ\epsilonϵ-greedy strategy for contextual bandits. More precisely, in a setting with finitely many arms, we consider that the mean reward functions lie in a reproducing kernel Hilbert space (RKHS). We propose an online weighted kernel ridge regression estimator for the reward functions. Under some conditions on the exploration probability sequence, {ϵt}t\{\epsilon_t\}_t{ϵt​}t​, and choice of the regularization parameter, {λt}t\{\lambda_t\}_t{λt​}t​, we show that the proposed estimator is consistent. We also show that for any choice of kernel and the corresponding RKHS, we achieve a sub-linear regret rate depending on the intrinsic dimensionality of the RKHS. Furthermore, we achieve the optimal regret rate of T\sqrt{T}T​ under a margin condition for finite-dimensional RKHS.

View on arXiv
@article{arya2025_2306.17329,
  title={ Kernel $ε$-Greedy for Multi-Armed Bandits with Covariates },
  author={ Sakshi Arya and Bharath K. Sriperumbudur },
  journal={arXiv preprint arXiv:2306.17329},
  year={ 2025 }
}
Comments on this paper