Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2302.12736
Cited By
Balanced Off-Policy Evaluation for Personalized Pricing
24 February 2023
Adam N. Elmachtoub
Vishal Gupta
Yunfan Zhao
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Balanced Off-Policy Evaluation for Personalized Pricing"
4 / 4 papers shown
Title
The Bandit Whisperer: Communication Learning for Restless Bandits
Yunfan Zhao
Tonghan Wang
Dheeraj M. Nagaraj
Aparna Taneja
Milind Tambe
49
5
0
11 Aug 2024
Confounding-Robust Policy Improvement with Human-AI Teams
Ruijiang Gao
Mingzhang Yin
24
3
0
13 Oct 2023
Statistically Efficient Variance Reduction with Double Policy Estimation for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning
Hanhan Zhou
Tian-Shing Lan
Vaneet Aggarwal
OffRL
27
4
0
28 Aug 2023
Implicit Two-Tower Policies
Yunfan Zhao
Qingkai Pan
K. Choromanski
Deepali Jain
Vikas Sindhwani
OffRL
28
3
0
02 Aug 2022
1