ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1703.06180
  4. Cited By
Effective Evaluation using Logged Bandit Feedback from Multiple Loggers

Effective Evaluation using Logged Bandit Feedback from Multiple Loggers

17 March 2017
Aman Agarwal
Soumya Basu
Tobias Schnabel
Thorsten Joachims
    OffRL
ArXivPDFHTML

Papers citing "Effective Evaluation using Logged Bandit Feedback from Multiple Loggers"

12 / 12 papers shown
Title
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible
  Off-Policy Evaluation
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation
Yuta Saito
Shunsuke Aihara
Megumi Matsutani
Yusuke Narita
OffRL
122
75
0
17 Aug 2020
Unbiased Learning-to-Rank with Biased Feedback
Unbiased Learning-to-Rank with Biased Feedback
Thorsten Joachims
Adith Swaminathan
Tobias Schnabel
CML
70
538
0
16 Aug 2016
Unbiased Comparative Evaluation of Ranking Functions
Unbiased Comparative Evaluation of Ranking Functions
Tobias Schnabel
Adith Swaminathan
P. Frazier
Thorsten Joachims
36
27
0
25 Apr 2016
Generalized Multiple Importance Sampling
Generalized Multiple Importance Sampling
Victor Elvira
Luca Martino
D. Luengo
M. Bugallo
41
144
0
10 Nov 2015
Efficient Multiple Importance Sampling Estimators
Efficient Multiple Importance Sampling Estimators
Victor Elvira
Luca Martino
D. Luengo
M. Bugallo
46
75
0
20 May 2015
Optimal mixture weights in multiple importance sampling
Optimal mixture weights in multiple importance sampling
Hera Y. He
Art B. Owen
60
29
0
14 Nov 2014
Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits
Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits
Alekh Agarwal
Daniel J. Hsu
Satyen Kale
John Langford
Lihong Li
Robert Schapire
OffRL
223
504
0
04 Feb 2014
Counterfactual Reasoning and Learning Systems
Counterfactual Reasoning and Learning Systems
Léon Bottou
J. Peters
J. Q. Candela
Denis Xavier Charles
D. M. Chickering
Elon Portugaly
Dipankar Ray
Patrice Y. Simard
Edward Snelson
CML
OffRL
212
781
0
11 Sep 2012
Unbiased Offline Evaluation of Contextual-bandit-based News Article
  Recommendation Algorithms
Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms
Lihong Li
Wei Chu
John Langford
Xuanhui Wang
OffRL
166
574
0
31 Mar 2010
A Contextual-Bandit Approach to Personalized News Article Recommendation
A Contextual-Bandit Approach to Personalized News Article Recommendation
Lihong Li
Wei Chu
John Langford
Robert Schapire
307
2,935
0
28 Feb 2010
Learning from Logged Implicit Exploration Data
Learning from Logged Implicit Exploration Data
Alexander L. Strehl
John Langford
Sham Kakade
Lihong Li
OffRL
121
254
0
27 Feb 2010
Adaptive Multiple Importance Sampling
Adaptive Multiple Importance Sampling
J. Cornuet
Jean-Michel Marin
Antonietta Mira
Christian P. Robert
71
263
0
07 Jul 2009
1