Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2405.02594
Cited By
Leveraging (Biased) Information: Multi-armed Bandits with Offline Data
4 May 2024
Wang Chi Cheung
Lixing Lyu
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Leveraging (Biased) Information: Multi-armed Bandits with Offline Data"
6 / 6 papers shown
Title
Augmenting Online RL with Offline Data is All You Need: A Unified Hybrid RL Algorithm Design and Analysis
Ruiquan Huang
Donghao Li
Chengshuai Shi
Cong Shen
Jing Yang
OffRL
97
0
0
01 Jul 2025
Multi-Armed Bandits With Machine Learning-Generated Surrogate Rewards
Wenlong Ji
Yihan Pan
Ruihao Zhu
Lihua Lei
7
0
0
20 Jun 2025
Best Arm Identification with Possibly Biased Offline Data
Le Yang
Vincent Y. F. Tan
Wang Chi Cheung
25
0
0
29 May 2025
Learning to Price with Resource Constraints: From Full Information to Machine-Learned Prices
Ruicheng Ao
Jiashuo Jiang
D. Simchi-Levi
114
2
0
24 Jan 2025
Beyond IID: data-driven decision-making in heterogeneous environments
Omar Besbes
Will Ma
Omar Mouchtaki
102
8
0
03 Jan 2025
Artificial Replay: A Meta-Algorithm for Harnessing Historical Data in Bandits
Siddhartha Banerjee
Sean R. Sinclair
Milind Tambe
Lily Xu
Chao Yu
AI4TS
187
8
0
30 Sep 2022
1