Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2408.15535
Cited By
Improving Thompson Sampling via Information Relaxation for Budgeted Multi-armed Bandits
28 August 2024
Woojin Jeong
Seungki Min
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Improving Thompson Sampling via Information Relaxation for Budgeted Multi-armed Bandits"
2 / 2 papers shown
Title
Faster Last-iterate Convergence of Policy Optimization in Zero-Sum Markov Games
Shicong Cen
Yuejie Chi
S. Du
Lin Xiao
33
26
0
03 Oct 2022
Large Scale Interactive Motion Forecasting for Autonomous Driving : The Waymo Open Motion Dataset
Scott Ettinger
Shuyang Cheng
Benjamin Caine
Chenxi Liu
Hang Zhao
...
Jiquan Ngiam
Vijay Vasudevan
Alexander McCauley
Jonathon Shlens
Drago Anguelov
123
421
0
20 Apr 2021
1