Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits

20 August 2024

Yuta Saito

Papers citing "Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits"

2 / 2 papers shown

Title
Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions James McInerney B. Brost Praveen Chandar Rishabh Mehrotra Ben Carterette BDL CML OffRL 107 55 0 25 Jul 2020
Matroid Bandits: Fast Combinatorial Optimization with Learning B. Kveton Zheng Wen Azin Ashkan Hoda Eydgahi Brian Eriksson 41 119 0 20 Mar 2014