ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.04241
  4. Cited By
On Optimistic versus Randomized Exploration in Reinforcement Learning

On Optimistic versus Randomized Exploration in Reinforcement Learning

13 June 2017
Ian Osband
Benjamin Van Roy
ArXiv (abs)PDFHTML

Papers citing "On Optimistic versus Randomized Exploration in Reinforcement Learning"

6 / 6 papers shown
Matrix games with bandit feedback
Matrix games with bandit feedback
Brendan O'Donoghue
Tor Lattimore
Ian Osband
133
12
0
09 Jun 2020
Seamlessly Unifying Attributes and Items: Conversational Recommendation
  for Cold-Start Users
Seamlessly Unifying Attributes and Items: Conversational Recommendation for Cold-Start Users
Shijun Li
Wenqiang Lei
Qingyun Wu
Xiangnan He
Peng Jiang
Tat-Seng Chua
525
133
0
23 May 2020
Time Adaptive Reinforcement Learning
Time Adaptive Reinforcement Learning
Chris Reinke
79
1
0
18 Apr 2020
Personalized HeartSteps: A Reinforcement Learning Algorithm for
  Optimizing Physical Activity
Personalized HeartSteps: A Reinforcement Learning Algorithm for Optimizing Physical ActivityProceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies (IMWUT), 2019
Peng Liao
Kristjan Greenewald
P. Klasnja
Susan Murphy
132
95
0
08 Sep 2019
Scalable Coordinated Exploration in Concurrent Reinforcement Learning
Scalable Coordinated Exploration in Concurrent Reinforcement Learning
Maria Dimakopoulou
Ian Osband
Benjamin Van Roy
OffRL
159
25
0
23 May 2018
Coordinated Exploration in Concurrent Reinforcement Learning
Coordinated Exploration in Concurrent Reinforcement Learning
Maria Dimakopoulou
Benjamin Van Roy
217
42
0
05 Feb 2018
1