Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1405.6757
Cited By
Proximal Reinforcement Learning: A New Theory of Sequential Decision Making in Primal-Dual Spaces
26 May 2014
Sridhar Mahadevan
Bo Liu
Philip S. Thomas
Will Dabney
S. Giguere
Nicholas Jacek
I. Gemp
Ji Liu
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Proximal Reinforcement Learning: A New Theory of Sequential Decision Making in Primal-Dual Spaces"
24 / 24 papers shown
Title
Proximal Bellman mappings for reinforcement learning and their application to robust adaptive filtering
Yuki Akiyama
Konstantinos Slavakis
40
1
0
14 Sep 2023
Toward Efficient Gradient-Based Value Estimation
Arsalan Sharifnassab
R. Sutton
58
3
0
31 Jan 2023
Regularized Q-learning
Han-Dong Lim
Donghwan Lee
67
11
0
11 Feb 2022
An Empirical Comparison of Off-policy Prediction Learning Algorithms in the Four Rooms Environment
Sina Ghiassian
R. Sutton
AAML
OffRL
87
6
0
10 Sep 2021
An Empirical Comparison of Off-policy Prediction Learning Algorithms on the Collision Task
Sina Ghiassian
R. Sutton
AAML
OffRL
78
5
0
02 Jun 2021
Gradient Temporal-Difference Learning with Regularized Corrections
Sina Ghiassian
Andrew Patterson
Shivam Garg
Dhawal Gupta
Adam White
Martha White
172
42
0
01 Jul 2020
Borrowing From the Future: Addressing Double Sampling in Model-free Control
Yuhua Zhu
Zachary Izzo
Lexing Ying
22
4
0
11 Jun 2020
Finite-Sample Analysis of Proximal Gradient TD Algorithms
Bo Liu
Ji Liu
Mohammad Ghavamzadeh
Sridhar Mahadevan
Marek Petrik
70
158
0
06 Jun 2020
Optimization for Reinforcement Learning: From Single Agent to Cooperative Agents
Dong-hwan Lee
Niao He
Parameswaran Kamalaruban
Volkan Cevher
52
89
0
01 Dec 2019
Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation
Shangtong Zhang
Bo Liu
Hengshuai Yao
Shimon Whiteson
OffRL
104
8
0
11 Nov 2019
Zap Q-Learning With Nonlinear Function Approximation
Shuhang Chen
Adithya M. Devraj
Fan Lu
Ana Bušić
Sean P. Meyn
64
20
0
11 Oct 2019
A generalization of regularized dual averaging and its dynamics
Shih-Kang Chao
Guang Cheng
51
18
0
22 Sep 2019
Entropic Regularization of Markov Decision Processes
Boris Belousov
Jan Peters
73
24
0
06 Jul 2019
Target-Based Temporal Difference Learning
Donghwan Lee
Niao He
OOD
75
31
0
24 Apr 2019
Scalable Bilinear
π
π
π
Learning Using State and Action Features
Yichen Chen
Lihong Li
Mengdi Wang
83
46
0
27 Apr 2018
Barrier-Certified Adaptive Reinforcement Learning with Applications to Brushbot Navigation
Motoya Ohnishi
Li Wang
Gennaro Notomista
M. Egerstedt
88
69
0
29 Jan 2018
SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation
Bo Dai
Albert Eaton Shaw
Lihong Li
Lin Xiao
Niao He
Zhen Liu
Jianshu Chen
Le Song
87
25
0
29 Dec 2017
On Convergence of some Gradient-based Temporal-Differences Algorithms for Off-Policy Learning
Huizhen Yu
OffRL
99
32
0
27 Dec 2017
On Generalized Bellman Equations and Temporal-Difference Learning
Huizhen Yu
A. R. Mahmood
R. Sutton
118
29
0
14 Apr 2017
Stochastic Primal-Dual Methods and Sample Complexity of Reinforcement Learning
Yichen Chen
Mengdi Wang
90
64
0
08 Dec 2016
Accelerated Gradient Temporal Difference Learning
Yangchen Pan
Adam White
Martha White
40
27
0
28 Nov 2016
Investigating practical linear temporal difference learning
Adam White
Martha White
OffRL
91
41
0
28 Feb 2016
Weak Convergence Properties of Constrained Emphatic Temporal-difference Learning with Constant and Slowly Diminishing Stepsize
Huizhen Yu
79
30
0
23 Nov 2015
Playing with Duality: An Overview of Recent Primal-Dual Approaches for Solving Large-Scale Optimization Problems
N. Komodakis
J. Pesquet
117
398
0
20 Jun 2014
1