Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1806.11500
Cited By
v1
v2
v3
v4
v5
v6 (latest)
Bayesian Counterfactual Risk Minimization
29 June 2018
Ben London
Ted Sandler
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Bayesian Counterfactual Risk Minimization"
7 / 7 papers shown
Title
A General Framework for Off-Policy Learning with Partially-Observed Reward
Rikiya Takehi
Masahiro Asami
K. Kawakami
Yuta Saito
OffRL
35
0
0
17 Jun 2025
MultiScale Contextual Bandits for Long Term Objectives
Richa Rastogi
Yuta Saito
Thorsten Joachims
OffRL
82
0
0
22 Mar 2025
Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
Imad Aouali
Victor-Emmanuel Brunel
David Rohde
Anna Korba
OffRL
178
5
0
22 Feb 2024
Fast Offline Policy Optimization for Large Scale Recommendation
Otmane Sakhi
D. Rohde
Alexandre Gilotte
OffRL
90
4
0
08 Aug 2022
PAC-Bayesian Lifelong Learning For Multi-Armed Bandits
H. Flynn
David Reeb
M. Kandemir
Jan Peters
85
7
0
07 Mar 2022
Variance-Optimal Augmentation Logging for Counterfactual Evaluation in Contextual Bandits
Aaron David Tucker
Thorsten Joachims
OffRL
36
9
0
03 Feb 2022
Off-policy Bandits with Deficient Support
Noveen Sachdeva
Yi-Hsun Su
Thorsten Joachims
OffRL
200
78
0
16 Jun 2020
1