ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.04819
44
5

Contextual Recommendations and Low-Regret Cutting-Plane Algorithms

9 June 2021
Sreenivas Gollapudi
Guru Guruganesh
Kostas Kollias
Pasin Manurangsi
R. Leme
Jon Schneider
ArXiv (abs)PDFHTML
Abstract

We consider the following variant of contextual linear bandits motivated by routing applications in navigational engines and recommendation systems. We wish to learn a hidden ddd-dimensional value w∗w^*w∗. Every round, we are presented with a subset Xt⊆Rd\mathcal{X}_t \subseteq \mathbb{R}^dXt​⊆Rd of possible actions. If we choose (i.e. recommend to the user) action xtx_txt​, we obtain utility ⟨xt,w∗⟩\langle x_t, w^* \rangle⟨xt​,w∗⟩ but only learn the identity of the best action arg⁡max⁡x∈Xt⟨x,w∗⟩\arg\max_{x \in \mathcal{X}_t} \langle x, w^* \rangleargmaxx∈Xt​​⟨x,w∗⟩. We design algorithms for this problem which achieve regret O(dlog⁡T)O(d\log T)O(dlogT) and exp⁡(O(dlog⁡d))\exp(O(d \log d))exp(O(dlogd)). To accomplish this, we design novel cutting-plane algorithms with low "regret" -- the total distance between the true point w∗w^*w∗ and the hyperplanes the separation oracle returns. We also consider the variant where we are allowed to provide a list of several recommendations. In this variant, we give an algorithm with O(d2log⁡d)O(d^2 \log d)O(d2logd) regret and list size poly(d)\mathrm{poly}(d)poly(d). Finally, we construct nearly tight algorithms for a weaker variant of this problem where the learner only learns the identity of an action that is better than the recommendation. Our results rely on new algorithmic techniques in convex geometry (including a variant of Steiner's formula for the centroid of a convex set) which may be of independent interest.

View on arXiv
Comments on this paper