ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2408.15535
  4. Cited By
Improving Thompson Sampling via Information Relaxation for Budgeted
  Multi-armed Bandits

Improving Thompson Sampling via Information Relaxation for Budgeted Multi-armed Bandits

28 August 2024
Woojin Jeong
Seungki Min
ArXivPDFHTML

Papers citing "Improving Thompson Sampling via Information Relaxation for Budgeted Multi-armed Bandits"

2 / 2 papers shown
Title
Faster Last-iterate Convergence of Policy Optimization in Zero-Sum
  Markov Games
Faster Last-iterate Convergence of Policy Optimization in Zero-Sum Markov Games
Shicong Cen
Yuejie Chi
S. Du
Lin Xiao
33
26
0
03 Oct 2022
Large Scale Interactive Motion Forecasting for Autonomous Driving : The
  Waymo Open Motion Dataset
Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset
Scott Ettinger
Shuyang Cheng
Benjamin Caine
Chenxi Liu
Hang Zhao
...
Jiquan Ngiam
Vijay Vasudevan
Alexander McCauley
Jonathon Shlens
Drago Anguelov
123
421
0
20 Apr 2021
1