Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.17324
Cited By
Leveraging Offline Data in Linear Latent Bandits
27 May 2024
Chinmaya Kausik
Kevin Tan
Ambuj Tewari
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Leveraging Offline Data in Linear Latent Bandits"
3 / 3 papers shown
Title
Hybrid Reinforcement Learning Breaks Sample Size Barriers in Linear MDPs
Kevin Tan
Wei Fan
Yuting Wei
OffRL
77
2
0
08 Aug 2024
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
Mitsuhiko Nakamoto
Yuexiang Zhai
Anika Singh
Max Sobol Mark
Yi Ma
Chelsea Finn
Aviral Kumar
Sergey Levine
OffRL
OnRL
112
108
0
09 Mar 2023
Estimating means of bounded random variables by betting
Ian Waudby-Smith
Aaditya Ramdas
59
148
0
19 Oct 2020
1