ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1906.09323
  4. Cited By
Reinforcement Learning with Convex Constraints

Reinforcement Learning with Convex Constraints

21 June 2019
Sobhan Miryoosefi
Kianté Brantley
Hal Daumé
Miroslav Dudík
Robert Schapire
ArXivPDFHTML

Papers citing "Reinforcement Learning with Convex Constraints"

23 / 23 papers shown
Title
Constrained Online Decision-Making: A Unified Framework
Constrained Online Decision-Making: A Unified Framework
Haichen Hu
David Simchi-Levi
Navid Azizan
39
0
0
11 May 2025
Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form
Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form
Toshinori Kitamura
Tadashi Kozuno
Wataru Kumagai
Kenta Hoshino
Y. Hosoe
Kazumi Kasaura
Masashi Hamaya
Paavo Parmas
Yutaka Matsuo
74
1
0
29 Aug 2024
Non-maximizing policies that fulfill multi-criterion aspirations in expectation
Non-maximizing policies that fulfill multi-criterion aspirations in expectation
Simon Dima
Simon Fischer
J. Heitzig
Joss Oliver
28
1
0
08 Aug 2024
Rate-Preserving Reductions for Blackwell Approachability
Rate-Preserving Reductions for Blackwell Approachability
Christoph Dann
Yishay Mansour
M. Mohri
Jon Schneider
Balasubramanian Sivan
50
2
0
10 Jun 2024
A Dual Perspective of Reinforcement Learning for Imposing Policy Constraints
A Dual Perspective of Reinforcement Learning for Imposing Policy Constraints
Bram De Cooman
Johan A. K. Suykens
43
0
0
25 Apr 2024
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with
  Uniform PAC Guarantees
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees
Toshinori Kitamura
Tadashi Kozuno
Masahiro Kato
Yuki Ichihara
Soichiro Nishimori
Akiyoshi Sannai
Sho Sonoda
Wataru Kumagai
Yutaka Matsuo
49
2
0
31 Jan 2024
Pseudonorm Approachability and Applications to Regret Minimization
Pseudonorm Approachability and Applications to Regret Minimization
Christoph Dann
Yishay Mansour
M. Mohri
Jon Schneider
Balasubramanian Sivan
39
5
0
03 Feb 2023
Towards Artificial Virtuous Agents: Games, Dilemmas and Machine Learning
Ajay Vishwanath
E. Bøhn
Ole-Christoffer Granmo
Charl Maree
C. Omlin
AI4CE
35
5
0
30 Aug 2022
A Review of Safe Reinforcement Learning: Methods, Theory and
  Applications
A Review of Safe Reinforcement Learning: Methods, Theory and Applications
Shangding Gu
Longyu Yang
Yali Du
Guang Chen
Florian Walter
Jun Wang
Alois C. Knoll
OffRL
AI4TS
117
241
0
20 May 2022
Reinforcement Learning with Intrinsic Affinity for Personalized
  Prosperity Management
Reinforcement Learning with Intrinsic Affinity for Personalized Prosperity Management
Charl Maree
C. Omlin
40
1
0
20 Apr 2022
An Optical Control Environment for Benchmarking Reinforcement Learning
  Algorithms
An Optical Control Environment for Benchmarking Reinforcement Learning Algorithms
Abulikemu Abuduweili
Changliu Liu
24
1
0
23 Mar 2022
Challenging Common Assumptions in Convex Reinforcement Learning
Challenging Common Assumptions in Convex Reinforcement Learning
Mirco Mutti
Ric De Santi
Piersilvio De Bartolomeis
Marcello Restelli
OffRL
37
21
0
03 Feb 2022
Reinforcement Learning Your Way: Agent Characterization through Policy
  Regularization
Reinforcement Learning Your Way: Agent Characterization through Policy Regularization
Charl Maree
C. Omlin
27
8
0
21 Jan 2022
Efficient Performance Bounds for Primal-Dual Reinforcement Learning from
  Demonstrations
Efficient Performance Bounds for Primal-Dual Reinforcement Learning from Demonstrations
Angeliki Kamoutsi
G. Banjac
John Lygeros
OffRL
31
7
0
28 Dec 2021
Deep Reinforcement Learning for Demand Driven Services in Logistics and
  Transportation Systems: A Survey
Deep Reinforcement Learning for Demand Driven Services in Logistics and Transportation Systems: A Survey
Zefang Zong
Tao Feng
Tong Xia
Depeng Jin
Yong Li
27
3
0
10 Aug 2021
Constraints Penalized Q-learning for Safe Offline Reinforcement Learning
Constraints Penalized Q-learning for Safe Offline Reinforcement Learning
Haoran Xu
Xianyuan Zhan
Xiangyu Zhu
OffRL
16
86
0
19 Jul 2021
A Simple Reward-free Approach to Constrained Reinforcement Learning
A Simple Reward-free Approach to Constrained Reinforcement Learning
Sobhan Miryoosefi
Chi Jin
16
29
0
12 Jul 2021
Preference learning along multiple criteria: A game-theoretic
  perspective
Preference learning along multiple criteria: A game-theoretic perspective
Kush S. Bhatia
A. Pananjady
Peter L. Bartlett
Anca Dragan
Martin J. Wainwright
40
13
0
05 May 2021
Interactive Learning from Activity Description
Interactive Learning from Activity Description
Khanh Nguyen
Dipendra Kumar Misra
Robert Schapire
Miroslav Dudík
Patrick Shafto
52
34
0
13 Feb 2021
Provably Efficient Algorithms for Multi-Objective Competitive RL
Provably Efficient Algorithms for Multi-Objective Competitive RL
Tiancheng Yu
Yi Tian
J.N. Zhang
S. Sra
26
20
0
05 Feb 2021
Inverse Constrained Reinforcement Learning
Inverse Constrained Reinforcement Learning
Usman Anwar
Shehryar Malik
Alireza Aghasi
Ali Ahmed
18
58
0
19 Nov 2020
Reward-Free Exploration for Reinforcement Learning
Reward-Free Exploration for Reinforcement Learning
Chi Jin
A. Krishnamurthy
Max Simchowitz
Tiancheng Yu
OffRL
112
194
0
07 Feb 2020
Blackwell Approachability and Low-Regret Learning are Equivalent
Blackwell Approachability and Low-Regret Learning are Equivalent
Jacob D. Abernethy
Peter L. Bartlett
Elad Hazan
86
117
0
08 Nov 2010
1