ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.11074
  4. Cited By
Reward Constrained Policy Optimization

Reward Constrained Policy Optimization

28 May 2018
Chen Tessler
D. Mankowitz
Shie Mannor
ArXivPDFHTML

Papers citing "Reward Constrained Policy Optimization"

28 / 128 papers shown
Title
Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement
  Learning via Frank-Wolfe Policy Optimization
Escaping from Zero Gradient: Revisiting Action-Constrained Reinforcement Learning via Frank-Wolfe Policy Optimization
Jyun-Li Lin
Wei-Ting Hung
Shangtong Yang
Ping-Chun Hsieh
Xi Liu
32
14
0
22 Feb 2021
Factored Policy Gradients: Leveraging Structure for Efficient Learning
  in MOMDPs
Factored Policy Gradients: Leveraging Structure for Efficient Learning in MOMDPs
Thomas Spooner
N. Vadori
Sumitra Ganesh
30
7
0
20 Feb 2021
Reinforcement Learning for Datacenter Congestion Control
Reinforcement Learning for Datacenter Congestion Control
Chen Tessler
Yuval Shpigelman
Gal Dalal
Amit Mandelbaum
Doron Haritan Kazakov
Benjamin Fuhrer
Gal Chechik
Shie Mannor
34
32
0
18 Feb 2021
Separated Proportional-Integral Lagrangian for Chance Constrained
  Reinforcement Learning
Separated Proportional-Integral Lagrangian for Chance Constrained Reinforcement Learning
Baiyu Peng
Yao Mu
Jingliang Duan
Yang Guan
Shengbo Eben Li
Jianyu Chen
53
19
0
17 Feb 2021
Inverse Constrained Reinforcement Learning
Inverse Constrained Reinforcement Learning
Usman Anwar
Shehryar Malik
Alireza Aghasi
Ali Ahmed
18
58
0
19 Nov 2020
CRPO: A New Approach for Safe Reinforcement Learning with Convergence
  Guarantee
CRPO: A New Approach for Safe Reinforcement Learning with Convergence Guarantee
Tengyu Xu
Yingbin Liang
Guanghui Lan
52
122
0
11 Nov 2020
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping
Yujing Hu
Weixun Wang
Hangtian Jia
Yixiang Wang
Yingfeng Chen
Jianye Hao
Feng Wu
Changjie Fan
OffRL
22
173
0
05 Nov 2020
Learning to be Safe: Deep RL with a Safety Critic
Learning to be Safe: Deep RL with a Safety Critic
K. Srinivasan
Benjamin Eysenbach
Sehoon Ha
Jie Tan
Chelsea Finn
OffRL
38
143
0
27 Oct 2020
Robust Constrained Reinforcement Learning for Continuous Control with
  Model Misspecification
Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification
D. Mankowitz
D. A. Calian
Rae Jeong
Cosmin Paduraru
N. Heess
Sumanth Dathathri
Martin Riedmiller
Timothy A. Mann
26
12
0
20 Oct 2020
Constrained Markov Decision Processes via Backward Value Functions
Constrained Markov Decision Processes via Backward Value Functions
Harsh Satija
Philip Amortila
Joelle Pineau
40
51
0
26 Aug 2020
Learning with Safety Constraints: Sample Complexity of Reinforcement
  Learning for Constrained MDPs
Learning with Safety Constraints: Sample Complexity of Reinforcement Learning for Constrained MDPs
Aria HasanzadeZonuzy
Archana Bura
D. Kalathil
S. Shakkottai
32
38
0
01 Aug 2020
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Adam Stooke
Joshua Achiam
Pieter Abbeel
31
287
0
08 Jul 2020
An empirical investigation of the challenges of real-world reinforcement
  learning
An empirical investigation of the challenges of real-world reinforcement learning
Gabriel Dulac-Arnold
Nir Levine
D. Mankowitz
Jerry Li
Cosmin Paduraru
Sven Gowal
Todd Hester
OffRL
34
121
0
24 Mar 2020
Exploration-Exploitation in Constrained MDPs
Exploration-Exploitation in Constrained MDPs
Yonathan Efroni
Shie Mannor
Matteo Pirotta
33
171
0
04 Mar 2020
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization
Provably Efficient Safe Exploration via Primal-Dual Policy Optimization
Dongsheng Ding
Xiaohan Wei
Zhuoran Yang
Zhaoran Wang
M. Jovanović
25
159
0
01 Mar 2020
Learning in Markov Decision Processes under Constraints
Learning in Markov Decision Processes under Constraints
Rahul Singh
Abhishek Gupta
Ness B. Shroff
51
27
0
27 Feb 2020
Reward-Free Exploration for Reinforcement Learning
Reward-Free Exploration for Reinforcement Learning
Chi Jin
A. Krishnamurthy
Max Simchowitz
Tiancheng Yu
OffRL
112
194
0
07 Feb 2020
Learning Representations in Reinforcement Learning:An Information
  Bottleneck Approach
Learning Representations in Reinforcement Learning:An Information Bottleneck Approach
Yingjun Pei
Xinwen Hou
SSL
39
10
0
12 Nov 2019
Convergent Policy Optimization for Safe Reinforcement Learning
Convergent Policy Optimization for Safe Reinforcement Learning
Ming Yu
Zhuoran Yang
Mladen Kolar
Zhaoran Wang
16
91
0
26 Oct 2019
IPO: Interior-point Policy Optimization under Constraints
IPO: Interior-point Policy Optimization under Constraints
Yongshuai Liu
J. Ding
Xin Liu
24
176
0
21 Oct 2019
Neural Simplex Architecture
Neural Simplex Architecture
Dung Phan
Radu Grosu
N. Jansen
Nicola Paoletti
S. Smolka
Scott D. Stoller
22
61
0
01 Aug 2019
Reinforcement Learning with Convex Constraints
Reinforcement Learning with Convex Constraints
Sobhan Miryoosefi
Kianté Brantley
Hal Daumé
Miroslav Dudík
Robert Schapire
17
90
0
21 Jun 2019
A Bayesian Approach to Robust Reinforcement Learning
A Bayesian Approach to Robust Reinforcement Learning
E. Derman
D. Mankowitz
Timothy A. Mann
Shie Mannor
21
58
0
20 May 2019
Learning Gentle Object Manipulation with Curiosity-Driven Deep
  Reinforcement Learning
Learning Gentle Object Manipulation with Curiosity-Driven Deep Reinforcement Learning
Sandy H. Huang
Martina Zambelli
Jackie Kay
M. Martins
Yuval Tassa
P. Pilarski
R. Hadsell
28
50
0
20 Mar 2019
Value constrained model-free continuous control
Value constrained model-free continuous control
Steven Bohez
A. Abdolmaleki
Michael Neunert
J. Buchli
N. Heess
R. Hadsell
24
62
0
12 Feb 2019
Learning to Walk via Deep Reinforcement Learning
Learning to Walk via Deep Reinforcement Learning
Tuomas Haarnoja
Sehoon Ha
Aurick Zhou
Jie Tan
George Tucker
Sergey Levine
54
433
0
26 Dec 2018
First-order Methods Almost Always Avoid Saddle Points
First-order Methods Almost Always Avoid Saddle Points
Jason D. Lee
Ioannis Panageas
Georgios Piliouras
Max Simchowitz
Michael I. Jordan
Benjamin Recht
ODL
95
83
0
20 Oct 2017
Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach
Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach
Yinlam Chow
Aviv Tamar
Shie Mannor
Marco Pavone
73
314
0
06 Jun 2015
Previous
123