ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.02594
  4. Cited By
Leveraging (Biased) Information: Multi-armed Bandits with Offline Data

Leveraging (Biased) Information: Multi-armed Bandits with Offline Data

4 May 2024
Wang Chi Cheung
Lixing Lyu
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Leveraging (Biased) Information: Multi-armed Bandits with Offline Data"

6 / 6 papers shown
Title
Augmenting Online RL with Offline Data is All You Need: A Unified Hybrid RL Algorithm Design and Analysis
Augmenting Online RL with Offline Data is All You Need: A Unified Hybrid RL Algorithm Design and Analysis
Ruiquan Huang
Donghao Li
Chengshuai Shi
Cong Shen
Jing Yang
OffRL
97
0
0
01 Jul 2025
Multi-Armed Bandits With Machine Learning-Generated Surrogate Rewards
Multi-Armed Bandits With Machine Learning-Generated Surrogate Rewards
Wenlong Ji
Yihan Pan
Ruihao Zhu
Lihua Lei
7
0
0
20 Jun 2025
Best Arm Identification with Possibly Biased Offline Data
Best Arm Identification with Possibly Biased Offline Data
Le Yang
Vincent Y. F. Tan
Wang Chi Cheung
25
0
0
29 May 2025
Learning to Price with Resource Constraints: From Full Information to Machine-Learned Prices
Learning to Price with Resource Constraints: From Full Information to Machine-Learned Prices
Ruicheng Ao
Jiashuo Jiang
D. Simchi-Levi
114
2
0
24 Jan 2025
Beyond IID: data-driven decision-making in heterogeneous environments
Beyond IID: data-driven decision-making in heterogeneous environments
Omar Besbes
Will Ma
Omar Mouchtaki
102
8
0
03 Jan 2025
Artificial Replay: A Meta-Algorithm for Harnessing Historical Data in Bandits
Artificial Replay: A Meta-Algorithm for Harnessing Historical Data in Bandits
Siddhartha Banerjee
Sean R. Sinclair
Milind Tambe
Lily Xu
Chao Yu
AI4TS
187
8
0
30 Sep 2022
1