ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1904.10799
  4. Cited By
Three Methods for Training on Bandit Feedback
v1v2 (latest)

Three Methods for Training on Bandit Feedback

24 April 2019
Dmytro Mykhaylov
D. Rohde
Flavian Vasile
Martin Bompaire
Olivier Jeunen
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Three Methods for Training on Bandit Feedback"

1 / 1 papers shown
Title
Offline Evaluation of Reward-Optimizing Recommender Systems: The Case of
  Simulation
Offline Evaluation of Reward-Optimizing Recommender Systems: The Case of Simulation
Imad Aouali
Amine Benhalloum
Martin Bompaire
Benjamin Heymann
Olivier Jeunen
D. Rohde
Otmane Sakhi
Flavian Vasile
OffRL
56
2
0
18 Sep 2022
1