Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1402.0635
Cited By
Generalization and Exploration via Randomized Value Functions
4 February 2014
Ian Osband
Benjamin Van Roy
Zheng Wen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Generalization and Exploration via Randomized Value Functions"
15 / 15 papers shown
Title
Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning
Yongshuai Liu
Xin Liu
107
1
0
26 Mar 2025
Random Latent Exploration for Deep Reinforcement Learning
Srinath Mahankali
Zhang-Wei Hong
Ayush Sekhari
Alexander Rakhlin
Pulkit Agrawal
101
3
0
18 Jul 2024
State of the Art Control of Atari Games Using Shallow Reinforcement Learning
Yitao Liang
Marlos C. Machado
Erik Talvitie
Michael Bowling
38
113
0
04 Dec 2015
Sample Complexity of Episodic Fixed-Horizon Reinforcement Learning
Christoph Dann
Emma Brunskill
34
249
0
29 Oct 2015
Bootstrapped Thompson Sampling and Deep Exploration
Ian Osband
Benjamin Van Roy
59
105
0
01 Jul 2015
Model-based Reinforcement Learning and the Eluder Dimension
Ian Osband
Benjamin Van Roy
52
188
0
07 Jun 2014
Near-optimal Reinforcement Learning in Factored MDPs
Ian Osband
Benjamin Van Roy
54
121
0
15 Mar 2014
The Sample-Complexity of General Reinforcement Learning
Tor Lattimore
Marcus Hutter
P. Sunehag
VLM
41
67
0
22 Aug 2013
(More) Efficient Reinforcement Learning via Posterior Sampling
Ian Osband
Daniel Russo
Benjamin Van Roy
90
529
0
04 Jun 2013
Regret Bounds for Reinforcement Learning with Policy Advice
M. G. Azar
A. Lazaric
Emma Brunskill
51
36
0
05 May 2013
Efficient Reinforcement Learning for High Dimensional Linear Quadratic Systems
M. Ibrahimi
Adel Javanmard
Benjamin Van Roy
56
91
0
24 Mar 2013
Online Regret Bounds for Undiscounted Continuous Reinforcement Learning
R. Ortner
D. Ryabko
OffRL
56
85
0
11 Feb 2013
Learning to Optimize Via Posterior Sampling
Daniel Russo
Benjamin Van Roy
120
697
0
11 Jan 2013
Further Optimal Regret Bounds for Thompson Sampling
Shipra Agrawal
Navin Goyal
64
443
0
15 Sep 2012
REGAL: A Regularization based Algorithm for Reinforcement Learning in Weakly Communicating MDPs
Peter L. Bartlett
Ambuj Tewari
62
280
0
09 May 2012
1