ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1405.6757
  4. Cited By
Proximal Reinforcement Learning: A New Theory of Sequential Decision
  Making in Primal-Dual Spaces

Proximal Reinforcement Learning: A New Theory of Sequential Decision Making in Primal-Dual Spaces

26 May 2014
Sridhar Mahadevan
Bo Liu
Philip S. Thomas
Will Dabney
S. Giguere
Nicholas Jacek
I. Gemp
Ji Liu
ArXiv (abs)PDFHTML

Papers citing "Proximal Reinforcement Learning: A New Theory of Sequential Decision Making in Primal-Dual Spaces"

24 / 24 papers shown
Title
Proximal Bellman mappings for reinforcement learning and their
  application to robust adaptive filtering
Proximal Bellman mappings for reinforcement learning and their application to robust adaptive filtering
Yuki Akiyama
Konstantinos Slavakis
40
1
0
14 Sep 2023
Toward Efficient Gradient-Based Value Estimation
Toward Efficient Gradient-Based Value Estimation
Arsalan Sharifnassab
R. Sutton
58
3
0
31 Jan 2023
Regularized Q-learning
Regularized Q-learning
Han-Dong Lim
Donghwan Lee
67
11
0
11 Feb 2022
An Empirical Comparison of Off-policy Prediction Learning Algorithms in
  the Four Rooms Environment
An Empirical Comparison of Off-policy Prediction Learning Algorithms in the Four Rooms Environment
Sina Ghiassian
R. Sutton
AAMLOffRL
87
6
0
10 Sep 2021
An Empirical Comparison of Off-policy Prediction Learning Algorithms on
  the Collision Task
An Empirical Comparison of Off-policy Prediction Learning Algorithms on the Collision Task
Sina Ghiassian
R. Sutton
AAMLOffRL
78
5
0
02 Jun 2021
Gradient Temporal-Difference Learning with Regularized Corrections
Gradient Temporal-Difference Learning with Regularized Corrections
Sina Ghiassian
Andrew Patterson
Shivam Garg
Dhawal Gupta
Adam White
Martha White
172
42
0
01 Jul 2020
Borrowing From the Future: Addressing Double Sampling in Model-free
  Control
Borrowing From the Future: Addressing Double Sampling in Model-free Control
Yuhua Zhu
Zachary Izzo
Lexing Ying
22
4
0
11 Jun 2020
Finite-Sample Analysis of Proximal Gradient TD Algorithms
Finite-Sample Analysis of Proximal Gradient TD Algorithms
Bo Liu
Ji Liu
Mohammad Ghavamzadeh
Sridhar Mahadevan
Marek Petrik
70
158
0
06 Jun 2020
Optimization for Reinforcement Learning: From Single Agent to
  Cooperative Agents
Optimization for Reinforcement Learning: From Single Agent to Cooperative Agents
Dong-hwan Lee
Niao He
Parameswaran Kamalaruban
Volkan Cevher
52
89
0
01 Dec 2019
Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function
  Approximation
Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation
Shangtong Zhang
Bo Liu
Hengshuai Yao
Shimon Whiteson
OffRL
104
8
0
11 Nov 2019
Zap Q-Learning With Nonlinear Function Approximation
Zap Q-Learning With Nonlinear Function Approximation
Shuhang Chen
Adithya M. Devraj
Fan Lu
Ana Bušić
Sean P. Meyn
64
20
0
11 Oct 2019
A generalization of regularized dual averaging and its dynamics
A generalization of regularized dual averaging and its dynamics
Shih-Kang Chao
Guang Cheng
51
18
0
22 Sep 2019
Entropic Regularization of Markov Decision Processes
Entropic Regularization of Markov Decision Processes
Boris Belousov
Jan Peters
73
24
0
06 Jul 2019
Target-Based Temporal Difference Learning
Target-Based Temporal Difference Learning
Donghwan Lee
Niao He
OOD
75
31
0
24 Apr 2019
Scalable Bilinear $π$ Learning Using State and Action Features
Scalable Bilinear πππ Learning Using State and Action Features
Yichen Chen
Lihong Li
Mengdi Wang
83
46
0
27 Apr 2018
Barrier-Certified Adaptive Reinforcement Learning with Applications to
  Brushbot Navigation
Barrier-Certified Adaptive Reinforcement Learning with Applications to Brushbot Navigation
Motoya Ohnishi
Li Wang
Gennaro Notomista
M. Egerstedt
88
69
0
29 Jan 2018
SBEED: Convergent Reinforcement Learning with Nonlinear Function
  Approximation
SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation
Bo Dai
Albert Eaton Shaw
Lihong Li
Lin Xiao
Niao He
Zhen Liu
Jianshu Chen
Le Song
87
25
0
29 Dec 2017
On Convergence of some Gradient-based Temporal-Differences Algorithms
  for Off-Policy Learning
On Convergence of some Gradient-based Temporal-Differences Algorithms for Off-Policy Learning
Huizhen Yu
OffRL
99
32
0
27 Dec 2017
On Generalized Bellman Equations and Temporal-Difference Learning
On Generalized Bellman Equations and Temporal-Difference Learning
Huizhen Yu
A. R. Mahmood
R. Sutton
118
29
0
14 Apr 2017
Stochastic Primal-Dual Methods and Sample Complexity of Reinforcement
  Learning
Stochastic Primal-Dual Methods and Sample Complexity of Reinforcement Learning
Yichen Chen
Mengdi Wang
90
64
0
08 Dec 2016
Accelerated Gradient Temporal Difference Learning
Accelerated Gradient Temporal Difference Learning
Yangchen Pan
Adam White
Martha White
40
27
0
28 Nov 2016
Investigating practical linear temporal difference learning
Investigating practical linear temporal difference learning
Adam White
Martha White
OffRL
91
41
0
28 Feb 2016
Weak Convergence Properties of Constrained Emphatic Temporal-difference
  Learning with Constant and Slowly Diminishing Stepsize
Weak Convergence Properties of Constrained Emphatic Temporal-difference Learning with Constant and Slowly Diminishing Stepsize
Huizhen Yu
79
30
0
23 Nov 2015
Playing with Duality: An Overview of Recent Primal-Dual Approaches for
  Solving Large-Scale Optimization Problems
Playing with Duality: An Overview of Recent Primal-Dual Approaches for Solving Large-Scale Optimization Problems
N. Komodakis
J. Pesquet
117
398
0
20 Jun 2014
1