Effective Evaluation using Logged Bandit Feedback from Multiple Loggers

Effective Evaluation using Logged Bandit Feedback from Multiple Loggers

17 March 2017

Tobias Schnabel

Thorsten Joachims

Papers citing "Effective Evaluation using Logged Bandit Feedback from Multiple Loggers"

12 / 12 papers shown

Title
Open Bandit Dataset and Pipeline: Towards Realistic and Reproducible Off-Policy Evaluation Yuta Saito Shunsuke Aihara Megumi Matsutani Yusuke Narita OffRL 122 75 0 17 Aug 2020
Unbiased Learning-to-Rank with Biased Feedback Thorsten Joachims Adith Swaminathan Tobias Schnabel CML 70 538 0 16 Aug 2016
Unbiased Comparative Evaluation of Ranking Functions Tobias Schnabel Adith Swaminathan P. Frazier Thorsten Joachims 36 27 0 25 Apr 2016
Generalized Multiple Importance Sampling Victor Elvira Luca Martino D. Luengo M. Bugallo 41 144 0 10 Nov 2015
Efficient Multiple Importance Sampling Estimators Victor Elvira Luca Martino D. Luengo M. Bugallo 46 75 0 20 May 2015
Optimal mixture weights in multiple importance sampling Hera Y. He Art B. Owen 60 29 0 14 Nov 2014
Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits Alekh Agarwal Daniel J. Hsu Satyen Kale John Langford Lihong Li Robert Schapire OffRL 223 504 0 04 Feb 2014
Counterfactual Reasoning and Learning Systems Léon Bottou J. Peters J. Q. Candela Denis Xavier Charles D. M. Chickering Elon Portugaly Dipankar Ray Patrice Y. Simard Edward Snelson CML OffRL 212 781 0 11 Sep 2012
Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms Lihong Li Wei Chu John Langford Xuanhui Wang OffRL 166 574 0 31 Mar 2010
A Contextual-Bandit Approach to Personalized News Article Recommendation Lihong Li Wei Chu John Langford Robert Schapire 307 2,935 0 28 Feb 2010
Learning from Logged Implicit Exploration Data Alexander L. Strehl John Langford Sham Kakade Lihong Li OffRL 121 254 0 27 Feb 2010
Adaptive Multiple Importance Sampling J. Cornuet Jean-Michel Marin Antonietta Mira Christian P. Robert 71 263 0 07 Jul 2009