ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2408.11202
  4. Cited By
Effective Off-Policy Evaluation and Learning in Contextual Combinatorial
  Bandits

Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits

20 August 2024
Tatsuhiro Shimizu
Koichi Tanaka
Ren Kishimoto
Haruka Kiyohara
Masahiro Nomura
Yuta Saito
    CML
    OffRL
ArXivPDFHTML

Papers citing "Effective Off-Policy Evaluation and Learning in Contextual Combinatorial Bandits"

2 / 2 papers shown
Title
Counterfactual Evaluation of Slate Recommendations with Sequential
  Reward Interactions
Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions
James McInerney
B. Brost
Praveen Chandar
Rishabh Mehrotra
Ben Carterette
BDL
CML
OffRL
107
55
0
25 Jul 2020
Matroid Bandits: Fast Combinatorial Optimization with Learning
Matroid Bandits: Fast Combinatorial Optimization with Learning
B. Kveton
Zheng Wen
Azin Ashkan
Hoda Eydgahi
Brian Eriksson
41
119
0
20 Mar 2014
1