ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2204.04773
  4. Cited By
Worst-case Performance of Greedy Policies in Bandits with Imperfect
  Context Observations
v1v2 (latest)

Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations

IEEE Conference on Decision and Control (CDC), 2022
10 April 2022
Hongju Park
Mohamad Kazem Shirani Faradonbeh
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Worst-case Performance of Greedy Policies in Bandits with Imperfect Context Observations"

2 / 2 papers shown
Title
Thompson Sampling in Partially Observable Contextual Bandits
Thompson Sampling in Partially Observable Contextual Bandits
Hongju Park
Mohamad Kazem Shirani Faradonbeh
210
4
0
15 Feb 2024
Online learning in bandits with predicted context
Online learning in bandits with predicted contextInternational Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Yongyi Guo
Ziping Xu
Susan Murphy
187
5
0
26 Jul 2023
1