Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2202.06317
Cited By
Off-Policy Evaluation for Large Action Spaces via Embeddings
13 February 2022
Yuta Saito
Thorsten Joachims
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Off-Policy Evaluation for Large Action Spaces via Embeddings"
13 / 13 papers shown
Title
Doubly Robust Fusion of Many Treatments for Policy Learning
Ke Zhu
Jianing Chu
I. Lipkovich
Wenyu Ye
Shu Yang
34
0
0
12 May 2025
DOLCE: Decomposing Off-Policy Evaluation/Learning into Lagged and Current Effects
Shu Tamano
Masanori Nojima
OffRL
37
0
0
02 May 2025
Prompt Optimization with Logged Bandit Data
Haruka Kiyohara
Daniel Yiming Cao
Yuta Saito
Thorsten Joachims
64
0
0
03 Apr 2025
Cross-Validated Off-Policy Evaluation
Matej Cief
B. Kveton
Michal Kompan
OffRL
25
1
0
24 May 2024
Reduced-Rank Multi-objective Policy Learning and Optimization
Ezinne Nwankwo
Michael I. Jordan
Angela Zhou
OffRL
CML
22
0
0
29 Apr 2024
Hyperparameter Optimization Can Even be Harmful in Off-Policy Learning and How to Deal with It
Yuta Saito
Masahiro Nomura
OffRL
47
2
0
23 Apr 2024
Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
Imad Aouali
Victor-Emmanuel Brunel
David Rohde
Anna Korba
OffRL
36
5
0
22 Feb 2024
Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction
Haruka Kiyohara
Masahiro Nomura
Yuta Saito
25
5
0
03 Feb 2024
Recent Advances in the Foundations and Applications of Unbiased Learning to Rank
Shashank Gupta
Philipp Hager
Jin Huang
Ali Vardasbi
Harrie Oosterhuis
OffRL
35
5
0
04 May 2023
Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in Healthcare
Shengpu Tang
Maggie Makar
Michael Sjoding
Finale Doshi-Velez
Jenna Wiens
OffRL
55
39
0
02 May 2023
Off-Policy Evaluation in Embedded Spaces
Jaron J. R. Lee
David Arbour
Georgios Theocharous
OffRL
22
3
0
05 Mar 2022
Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions
James McInerney
B. Brost
Praveen Chandar
Rishabh Mehrotra
Ben Carterette
BDL
CML
OffRL
118
55
0
25 Jul 2020
Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes
Nathan Kallus
Masatoshi Uehara
OffRL
38
181
0
22 Aug 2019
1