ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2111.04835
  4. Cited By
Safe Data Collection for Offline and Online Policy Learning

Safe Data Collection for Offline and Online Policy Learning

8 November 2021
Ruihao Zhu
B. Kveton
    OffRL
ArXivPDFHTML

Papers citing "Safe Data Collection for Offline and Online Policy Learning"

5 / 5 papers shown
Title
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in
  Tabular MDP
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
Subhojyoti Mukherjee
Josiah P. Hanna
Robert Nowak
OffRL
51
0
0
04 Jun 2024
Efficient and Interpretable Bandit Algorithms
Efficient and Interpretable Bandit Algorithms
Subhojyoti Mukherjee
Ruihao Zhu
B. Kveton
FAtt
23
2
0
23 Oct 2023
SPEED: Experimental Design for Policy Evaluation in Linear
  Heteroscedastic Bandits
SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits
Subhojyoti Mukherjee
Qiaomin Xie
Josiah P. Hanna
R. Nowak
OffRL
53
5
0
29 Jan 2023
Risk-Aware Linear Bandits: Theory and Applications in Smart Order
  Routing
Risk-Aware Linear Bandits: Theory and Applications in Smart Order Routing
Jingwei Ji
Renyuan Xu
Ruihao Zhu
20
0
0
04 Aug 2022
Nonparametric Pricing Analytics with Customer Covariates
Nonparametric Pricing Analytics with Customer Covariates
Ningyuan Chen
G. Gallego
59
39
0
03 May 2018
1