ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1402.0635
  4. Cited By
Generalization and Exploration via Randomized Value Functions

Generalization and Exploration via Randomized Value Functions

4 February 2014
Ian Osband
Benjamin Van Roy
Zheng Wen
ArXivPDFHTML

Papers citing "Generalization and Exploration via Randomized Value Functions"

15 / 15 papers shown
Title
Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning
Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning
Yongshuai Liu
Xin Liu
107
1
0
26 Mar 2025
Random Latent Exploration for Deep Reinforcement Learning
Random Latent Exploration for Deep Reinforcement Learning
Srinath Mahankali
Zhang-Wei Hong
Ayush Sekhari
Alexander Rakhlin
Pulkit Agrawal
101
3
0
18 Jul 2024
State of the Art Control of Atari Games Using Shallow Reinforcement
  Learning
State of the Art Control of Atari Games Using Shallow Reinforcement Learning
Yitao Liang
Marlos C. Machado
Erik Talvitie
Michael Bowling
38
113
0
04 Dec 2015
Sample Complexity of Episodic Fixed-Horizon Reinforcement Learning
Sample Complexity of Episodic Fixed-Horizon Reinforcement Learning
Christoph Dann
Emma Brunskill
34
249
0
29 Oct 2015
Bootstrapped Thompson Sampling and Deep Exploration
Bootstrapped Thompson Sampling and Deep Exploration
Ian Osband
Benjamin Van Roy
59
105
0
01 Jul 2015
Model-based Reinforcement Learning and the Eluder Dimension
Model-based Reinforcement Learning and the Eluder Dimension
Ian Osband
Benjamin Van Roy
52
188
0
07 Jun 2014
Near-optimal Reinforcement Learning in Factored MDPs
Near-optimal Reinforcement Learning in Factored MDPs
Ian Osband
Benjamin Van Roy
54
121
0
15 Mar 2014
The Sample-Complexity of General Reinforcement Learning
The Sample-Complexity of General Reinforcement Learning
Tor Lattimore
Marcus Hutter
P. Sunehag
VLM
41
67
0
22 Aug 2013
(More) Efficient Reinforcement Learning via Posterior Sampling
(More) Efficient Reinforcement Learning via Posterior Sampling
Ian Osband
Daniel Russo
Benjamin Van Roy
90
529
0
04 Jun 2013
Regret Bounds for Reinforcement Learning with Policy Advice
Regret Bounds for Reinforcement Learning with Policy Advice
M. G. Azar
A. Lazaric
Emma Brunskill
51
36
0
05 May 2013
Efficient Reinforcement Learning for High Dimensional Linear Quadratic
  Systems
Efficient Reinforcement Learning for High Dimensional Linear Quadratic Systems
M. Ibrahimi
Adel Javanmard
Benjamin Van Roy
56
91
0
24 Mar 2013
Online Regret Bounds for Undiscounted Continuous Reinforcement Learning
Online Regret Bounds for Undiscounted Continuous Reinforcement Learning
R. Ortner
D. Ryabko
OffRL
56
85
0
11 Feb 2013
Learning to Optimize Via Posterior Sampling
Learning to Optimize Via Posterior Sampling
Daniel Russo
Benjamin Van Roy
120
697
0
11 Jan 2013
Further Optimal Regret Bounds for Thompson Sampling
Further Optimal Regret Bounds for Thompson Sampling
Shipra Agrawal
Navin Goyal
64
443
0
15 Sep 2012
REGAL: A Regularization based Algorithm for Reinforcement Learning in
  Weakly Communicating MDPs
REGAL: A Regularization based Algorithm for Reinforcement Learning in Weakly Communicating MDPs
Peter L. Bartlett
Ambuj Tewari
62
280
0
09 May 2012
1