Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2006.05094
Cited By
Meta-Learning Bandit Policies by Gradient Ascent
9 June 2020
B. Kveton
Martin Mladenov
Chih-Wei Hsu
Manzil Zaheer
Csaba Szepesvári
Craig Boutilier
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Meta-Learning Bandit Policies by Gradient Ascent"
5 / 5 papers shown
Title
Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Javad Azizi
T. Duong
Yasin Abbasi-Yadkori
András Gyorgy
Claire Vernade
Mohammad Ghavamzadeh
26
8
0
25 Feb 2022
Meta-Learning for Simple Regret Minimization
Javad Azizi
B. Kveton
Mohammad Ghavamzadeh
S. Katariya
16
10
0
25 Feb 2022
Meta-Thompson Sampling
B. Kveton
Mikhail Konobeev
Manzil Zaheer
Chih-Wei Hsu
Martin Mladenov
Craig Boutilier
Csaba Szepesvári
40
61
0
11 Feb 2021
Probabilistic Model-Agnostic Meta-Learning
Chelsea Finn
Kelvin Xu
Sergey Levine
BDL
165
666
0
07 Jun 2018
Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks
Chelsea Finn
Pieter Abbeel
Sergey Levine
OOD
281
11,681
0
09 Mar 2017
1