ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.06317
  4. Cited By
Off-Policy Evaluation for Large Action Spaces via Embeddings

Off-Policy Evaluation for Large Action Spaces via Embeddings

13 February 2022
Yuta Saito
Thorsten Joachims
    OffRL
ArXivPDFHTML

Papers citing "Off-Policy Evaluation for Large Action Spaces via Embeddings"

13 / 13 papers shown
Title
Doubly Robust Fusion of Many Treatments for Policy Learning
Doubly Robust Fusion of Many Treatments for Policy Learning
Ke Zhu
Jianing Chu
I. Lipkovich
Wenyu Ye
Shu Yang
34
0
0
12 May 2025
DOLCE: Decomposing Off-Policy Evaluation/Learning into Lagged and Current Effects
DOLCE: Decomposing Off-Policy Evaluation/Learning into Lagged and Current Effects
Shu Tamano
Masanori Nojima
OffRL
37
0
0
02 May 2025
Prompt Optimization with Logged Bandit Data
Prompt Optimization with Logged Bandit Data
Haruka Kiyohara
Daniel Yiming Cao
Yuta Saito
Thorsten Joachims
64
0
0
03 Apr 2025
Cross-Validated Off-Policy Evaluation
Cross-Validated Off-Policy Evaluation
Matej Cief
B. Kveton
Michal Kompan
OffRL
25
1
0
24 May 2024
Reduced-Rank Multi-objective Policy Learning and Optimization
Reduced-Rank Multi-objective Policy Learning and Optimization
Ezinne Nwankwo
Michael I. Jordan
Angela Zhou
OffRL
CML
22
0
0
29 Apr 2024
Hyperparameter Optimization Can Even be Harmful in Off-Policy Learning
  and How to Deal with It
Hyperparameter Optimization Can Even be Harmful in Off-Policy Learning and How to Deal with It
Yuta Saito
Masahiro Nomura
OffRL
47
2
0
23 Apr 2024
Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
Imad Aouali
Victor-Emmanuel Brunel
David Rohde
Anna Korba
OffRL
36
5
0
22 Feb 2024
Off-Policy Evaluation of Slate Bandit Policies via Optimizing
  Abstraction
Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction
Haruka Kiyohara
Masahiro Nomura
Yuta Saito
25
5
0
03 Feb 2024
Recent Advances in the Foundations and Applications of Unbiased Learning
  to Rank
Recent Advances in the Foundations and Applications of Unbiased Learning to Rank
Shashank Gupta
Philipp Hager
Jin Huang
Ali Vardasbi
Harrie Oosterhuis
OffRL
35
5
0
04 May 2023
Leveraging Factored Action Spaces for Efficient Offline Reinforcement
  Learning in Healthcare
Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in Healthcare
Shengpu Tang
Maggie Makar
Michael Sjoding
Finale Doshi-Velez
Jenna Wiens
OffRL
55
39
0
02 May 2023
Off-Policy Evaluation in Embedded Spaces
Off-Policy Evaluation in Embedded Spaces
Jaron J. R. Lee
David Arbour
Georgios Theocharous
OffRL
22
3
0
05 Mar 2022
Counterfactual Evaluation of Slate Recommendations with Sequential
  Reward Interactions
Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions
James McInerney
B. Brost
Praveen Chandar
Rishabh Mehrotra
Ben Carterette
BDL
CML
OffRL
118
55
0
25 Jul 2020
Double Reinforcement Learning for Efficient Off-Policy Evaluation in
  Markov Decision Processes
Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes
Nathan Kallus
Masatoshi Uehara
OffRL
38
181
0
22 Aug 2019
1