Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
1612.02516
Cited By
Stochastic Primal-Dual Methods and Sample Complexity of Reinforcement Learning
8 December 2016
Yichen Chen
Mengdi Wang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Stochastic Primal-Dual Methods and Sample Complexity of Reinforcement Learning"
31 / 31 papers shown
Title
Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality
Shaocong Ma
Ziyi Chen
Yi Zhou
Heng Huang
OffRL
112
4
0
24 Aug 2025
A Two-Timescale Primal-Dual Framework for Reinforcement Learning via Online Dual Variable Guidance
Axel Friedrich Wolter
Tobias Sutter
OffRL
176
0
0
07 May 2025
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning
Zihao Li
Boyi Liu
Zhuoran Yang
Zhaoran Wang
Mengdi Wang
241
2
0
16 Feb 2024
Robust Markov Decision Processes without Model Estimation
Wenhao Yang
Hanfengzhai Wang
Tadashi Kozuno
S. Jordan
Zhihua Zhang
214
7
0
02 Feb 2023
Lagrangian Method for Q-Function Learning (with Applications to Machine Translation)
International Conference on Machine Learning (ICML), 2022
Bojun Huang
143
1
0
22 Jul 2022
What is a Good Metric to Study Generalization of Minimax Learners?
Neural Information Processing Systems (NeurIPS), 2022
Asuman Ozdaglar
S. Pattathil
Jiawei Zhang
Jianchao Tan
153
15
0
09 Jun 2022
Provably Efficient Convergence of Primal-Dual Actor-Critic with Nonlinear Function Approximation
Adaptive Agents and Multi-Agent Systems (AAMAS), 2022
Jing Dong
Li Shen
Ying Xu
Baoxiang Wang
155
1
0
28 Feb 2022
Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles
Xinglong Zhang
Yaoqian Peng
Biao Luo
Wei Pan
Xin Xu
Haibin Xie
124
18
0
18 Dec 2021
New Versions of Gradient Temporal Difference Learning
IEEE Transactions on Automatic Control (IEEE TAC), 2021
Dong-hwan Lee
Han-Dong Lim
Jihoon Park
Okyong Choi
236
5
0
09 Sep 2021
Learning to Act Safely with Limited Exposure and Almost Sure Certainty
IEEE Transactions on Automatic Control (IEEE TAC), 2021
Agustin Castellano
Hancheng Min
J. Bazerque
Enrique Mallada
188
4
0
18 May 2021
Near Optimal Policy Optimization via REPS
Neural Information Processing Systems (NeurIPS), 2021
Aldo Pacchiano
Jonathan Lee
Peter L. Bartlett
Ofir Nachum
138
3
0
17 Mar 2021
A Provably Efficient Sample Collection Strategy for Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2020
Jean Tarbouriech
Matteo Pirotta
Michal Valko
A. Lazaric
OffRL
180
18
0
13 Jul 2020
Efficient Planning in Large MDPs with Weak Linear Function Approximation
Neural Information Processing Systems (NeurIPS), 2020
R. Shariff
Csaba Szepesvári
130
22
0
13 Jul 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
890
2,300
0
04 May 2020
Cautious Reinforcement Learning via Distributional Risk in the Dual Domain
IEEE Journal on Selected Areas in Information Theory (JSAIT), 2020
Junyu Zhang
Amrit Singh Bedi
Mengdi Wang
Alec Koppel
130
28
0
27 Feb 2020
Global Convergence and Variance-Reduced Optimization for a Class of Nonconvex-Nonconcave Minimax Problems
Junchi Yang
Negar Kiyavash
Niao He
261
86
0
22 Feb 2020
Reinforcement Learning via Fenchel-Rockafellar Duality
Ofir Nachum
Bo Dai
OffRL
301
132
0
07 Jan 2020
AlgaeDICE: Policy Gradient from Arbitrary Experience
Ofir Nachum
Bo Dai
Ilya Kostrikov
Yinlam Chow
Lihong Li
Dale Schuurmans
OffRL
271
254
0
04 Dec 2019
Optimization for Reinforcement Learning: From Single Agent to Cooperative Agents
IEEE Signal Processing Magazine (IEEE SPM), 2019
Dong-hwan Lee
Niao He
Parameswaran Kamalaruban
Volkan Cevher
125
94
0
01 Dec 2019
Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms
Jianchao Tan
Zhuoran Yang
Tamer Basar
487
1,424
0
24 Nov 2019
A Reduction from Reinforcement Learning to No-Regret Online Learning
International Conference on Artificial Intelligence and Statistics (AISTATS), 2019
Ching-An Cheng
Rémi Tachet des Combes
Byron Boots
Geoffrey J. Gordon
OffRL
163
16
0
14 Nov 2019
Doubly Robust Bias Reduction in Infinite Horizon Off-Policy Estimation
International Conference on Learning Representations (ICLR), 2019
Ziyang Tang
Yihao Feng
Lihong Li
Dengyong Zhou
Qiang Liu
OffRL
239
70
0
16 Oct 2019
Voting-Based Multi-Agent Reinforcement Learning for Intelligent IoT
IEEE Internet of Things Journal (IEEE IoT Journal), 2019
Yue Xu
Zengde Deng
Mengdi Wang
Wenjun Xu
Anthony Man-Cho So
Shuguang Cui
142
16
0
02 Jul 2019
Probability Functional Descent: A Unifying Perspective on GANs, Variational Inference, and Reinforcement Learning
International Conference on Machine Learning (ICML), 2019
Casey Chu
Jose H. Blanchet
Peter Glynn
GAN
175
28
0
30 Jan 2019
Privacy-preserving Q-Learning with Functional Noise in Continuous State Spaces
Neural Information Processing Systems (NeurIPS), 2019
Baoxiang Wang
N. Hegde
215
68
0
30 Jan 2019
Multi-Agent Reinforcement Learning via Double Averaging Primal-Dual Optimization
Hoi-To Wai
Zhuoran Yang
Zhaoran Wang
Mingyi Hong
196
181
0
03 Jun 2018
Scalable Bilinear
π
π
π
Learning Using State and Action Features
Yichen Chen
Lihong Li
Mengdi Wang
151
47
0
27 Apr 2018
SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation
Bo Dai
Albert Eaton Shaw
Lihong Li
Lin Xiao
Niao He
Zhen Liu
Jianshu Chen
Le Song
222
25
0
29 Dec 2017
Boosting the Actor with Dual Critic
International Conference on Learning Representations (ICLR), 2017
Bo Dai
Albert Eaton Shaw
Niao He
Lihong Li
Le Song
130
46
0
29 Dec 2017
Deep Primal-Dual Reinforcement Learning: Accelerating Actor-Critic using Bellman Duality
W. Cho
Mengdi Wang
OffRL
78
14
0
07 Dec 2017
Primal-Dual
π
π
π
Learning: Sample Complexity and Sublinear Run Time for Ergodic Markov Decision Problems
Mengdi Wang
213
71
0
17 Oct 2017
1