Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1210.4862
Cited By
Sample-efficient Nonstationary Policy Evaluation for Contextual Bandits
16 October 2012
Miroslav Dudík
D. Erhan
John Langford
Lihong Li
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Sample-efficient Nonstationary Policy Evaluation for Contextual Bandits"
3 / 3 papers shown
Title
Second Order Bounds for Contextual Bandits with Function Approximation
Aldo Pacchiano
168
4
0
24 Sep 2024
Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
Imad Aouali
Victor-Emmanuel Brunel
David Rohde
Anna Korba
OffRL
106
5
0
22 Feb 2024
Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms
Lihong Li
Wei Chu
John Langford
Xuanhui Wang
OffRL
168
574
0
31 Mar 2010
1