ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.08471
  4. Cited By
Learning from Bandit Feedback: An Overview of the State-of-the-art

Learning from Bandit Feedback: An Overview of the State-of-the-art

18 September 2019
Olivier Jeunen
Dmytro Mykhaylov
D. Rohde
Flavian Vasile
Alexandre Gilotte
Martin Bompaire
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Learning from Bandit Feedback: An Overview of the State-of-the-art"

3 / 3 papers shown
Title
Offline Evaluation of Reward-Optimizing Recommender Systems: The Case of
  Simulation
Offline Evaluation of Reward-Optimizing Recommender Systems: The Case of Simulation
Imad Aouali
Amine Benhalloum
Martin Bompaire
Benjamin Heymann
Olivier Jeunen
D. Rohde
Otmane Sakhi
Flavian Vasile
OffRL
56
2
0
18 Sep 2022
Residual Overfit Method of Exploration
Residual Overfit Method of Exploration
James McInerney
Nathan Kallus
OffRLUQCV
22
0
0
06 Oct 2021
MARS-Gym: A Gym framework to model, train, and evaluate Recommender
  Systems for Marketplaces
MARS-Gym: A Gym framework to model, train, and evaluate Recommender Systems for Marketplaces
Marlesson R. O. Santana
Luckeciano C. Melo
Fernando H. F. Camargo
Bruno Brandão
Anderson Soares
Renan M. Oliveira
Sandor Caetano
OffRL
45
15
0
30 Sep 2020
1