ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.12736
  4. Cited By
Balanced Off-Policy Evaluation for Personalized Pricing

Balanced Off-Policy Evaluation for Personalized Pricing

24 February 2023
Adam N. Elmachtoub
Vishal Gupta
Yunfan Zhao
    OffRL
ArXivPDFHTML

Papers citing "Balanced Off-Policy Evaluation for Personalized Pricing"

4 / 4 papers shown
Title
The Bandit Whisperer: Communication Learning for Restless Bandits
The Bandit Whisperer: Communication Learning for Restless Bandits
Yunfan Zhao
Tonghan Wang
Dheeraj M. Nagaraj
Aparna Taneja
Milind Tambe
49
5
0
11 Aug 2024
Confounding-Robust Policy Improvement with Human-AI Teams
Confounding-Robust Policy Improvement with Human-AI Teams
Ruijiang Gao
Mingzhang Yin
24
3
0
13 Oct 2023
Statistically Efficient Variance Reduction with Double Policy Estimation
  for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning
Statistically Efficient Variance Reduction with Double Policy Estimation for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning
Hanhan Zhou
Tian-Shing Lan
Vaneet Aggarwal
OffRL
27
4
0
28 Aug 2023
Implicit Two-Tower Policies
Implicit Two-Tower Policies
Yunfan Zhao
Qingkai Pan
K. Choromanski
Deepali Jain
Vikas Sindhwani
OffRL
28
3
0
02 Aug 2022
1