Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2008.03959
Cited By
v1
v2
v3
v4 (latest)
Lenient Regret for Multi-Armed Bandits
10 August 2020
Nadav Merlis
Shie Mannor
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Lenient Regret for Multi-Armed Bandits"
1 / 1 papers shown
Title
Gap-Dependent Unsupervised Exploration for Reinforcement Learning
Jingfeng Wu
Vladimir Braverman
Lin F. Yang
78
12
0
11 Aug 2021
1