ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.10282
  4. Cited By
Boosting the Actor with Dual Critic

Boosting the Actor with Dual Critic

29 December 2017
Bo Dai
Albert Eaton Shaw
Niao He
Lihong Li
Le Song
ArXivPDFHTML

Papers citing "Boosting the Actor with Dual Critic"

12 / 12 papers shown
Title
A Two-Timescale Primal-Dual Framework for Reinforcement Learning via Online Dual Variable Guidance
A Two-Timescale Primal-Dual Framework for Reinforcement Learning via Online Dual Variable Guidance
Axel Friedrich Wolter
Tobias Sutter
OffRL
37
0
0
07 May 2025
A Dual Perspective of Reinforcement Learning for Imposing Policy Constraints
A Dual Perspective of Reinforcement Learning for Imposing Policy Constraints
Bram De Cooman
Johan A. K. Suykens
33
0
0
25 Apr 2024
The Landscape of the Proximal Point Method for Nonconvex-Nonconcave
  Minimax Optimization
The Landscape of the Proximal Point Method for Nonconvex-Nonconcave Minimax Optimization
Benjamin Grimmer
Haihao Lu
Pratik Worah
Vahab Mirrokni
37
9
0
15 Jun 2020
Black-box Off-policy Estimation for Infinite-Horizon Reinforcement
  Learning
Black-box Off-policy Estimation for Infinite-Horizon Reinforcement Learning
Ali Mousavi
Lihong Li
Qiang Liu
Denny Zhou
OffRL
11
32
0
24 Mar 2020
Reinforcement Learning via Fenchel-Rockafellar Duality
Reinforcement Learning via Fenchel-Rockafellar Duality
Ofir Nachum
Bo Dai
OffRL
11
117
0
07 Jan 2020
On the Global Convergence of Actor-Critic: A Case for Linear Quadratic
  Regulator with Ergodic Cost
On the Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost
Zhuoran Yang
Yongxin Chen
Mingyi Hong
Zhaoran Wang
26
39
0
14 Jul 2019
A Kernel Loss for Solving the Bellman Equation
A Kernel Loss for Solving the Bellman Equation
Yihao Feng
Lihong Li
Qiang Liu
22
70
0
25 May 2019
On the Global Convergence of Imitation Learning: A Case for Linear
  Quadratic Regulator
On the Global Convergence of Imitation Learning: A Case for Linear Quadratic Regulator
Qi Cai
Mingyi Hong
Yongxin Chen
Zhaoran Wang
19
34
0
11 Jan 2019
TD-Regularized Actor-Critic Methods
TD-Regularized Actor-Critic Methods
Simone Parisi
Voot Tangkaratt
Jan Peters
Mohammad Emtiyaz Khan
OffRL
14
31
0
19 Dec 2018
Neural Approaches to Conversational AI
Neural Approaches to Conversational AI
Jianfeng Gao
Michel Galley
Lihong Li
37
668
0
21 Sep 2018
Multi-Agent Reinforcement Learning via Double Averaging Primal-Dual
  Optimization
Multi-Agent Reinforcement Learning via Double Averaging Primal-Dual Optimization
Hoi-To Wai
Zhuoran Yang
Zhaoran Wang
Mingyi Hong
27
169
0
03 Jun 2018
SBEED: Convergent Reinforcement Learning with Nonlinear Function
  Approximation
SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation
Bo Dai
Albert Eaton Shaw
Lihong Li
Lin Xiao
Niao He
Zhen Liu
Jianshu Chen
Le Song
24
25
0
29 Dec 2017
1