Communities
Connect sessions
AI calendar
Organizations
Contact Sales
Search
Open menu
Home
Papers
1911.01546
Cited By
v1
v2 (latest)
Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy
5 November 2019
Ramtin Keramati
Christoph Dann
Alex Tamkin
Emma Brunskill
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy"
50 / 53 papers shown
Title
Optimistic Exploration for Risk-Averse Constrained Reinforcement Learning
J. McCarthy
Radu Marinescu
Elizabeth M. Daly
Ivana Dusparic
82
0
0
11 Jul 2025
Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL
Zhikun Tao
Gang Xiong
He Fang
Zhen Shen
Yunjun Han
Qing-Shan Jia
OffRL
222
0
0
13 May 2025
Return Capping: Sample-Efficient CVaR Policy Gradient Optimisation
Harry Mead
Clarissa Costen
Bruno Lacerda
Nick Hawes
213
1
0
29 Apr 2025
Beyond CVaR: Leveraging Static Spectral Risk Measures for Enhanced Decision-Making in Distributional Reinforcement Learning
Mehrdad Moghimi
Hyejin Ku
OffRL
182
1
0
03 Jan 2025
Q-learning for Quantile MDPs: A Decomposition, Performance, and Convergence Analysis
J. Hau
Erick Delage
Esther Derman
Mohammad Ghavamzadeh
Marek Petrik
94
2
0
31 Oct 2024
Burning RED: Unlocking Subtask-Driven Reinforcement Learning and Risk-Awareness in Average-Reward Markov Decision Processes
Juan Sebastian Rojas
Chi-Guhn Lee
154
2
0
14 Oct 2024
Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence
Minheng Xiao
Xian Yu
Lei Ying
187
2
0
23 May 2024
Robust Risk-Sensitive Reinforcement Learning with Conditional Value-at-Risk
Xinyi Ni
Lifeng Lai
103
3
0
02 May 2024
A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Yudong Luo
Yangchen Pan
Han Wang
Juil Sock
Pascal Poupart
179
3
0
17 Mar 2024
Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation
Yu Chen
Xiangcheng Zhang
Siwei Wang
Longbo Huang
154
3
0
28 Feb 2024
A unified uncertainty-aware exploration: Combining epistemic and aleatory uncertainty
Parvin Malekzadeh
Ming Hou
Konstantinos N. Plataniotis
UD
115
3
0
05 Jan 2024
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments
Jinyi Liu
Zhi Wang
Yan Zheng
Jianye Hao
Chenjia Bai
Junjie Ye
Zhen Wang
Haiyin Piao
Yang Sun
166
12
0
19 Dec 2023
Modeling Risk in Reinforcement Learning: A Literature Mapping
Leonardo Villalobos-Arias
Derek Martin
Abhijeet Krishnan
Madeleine Gagné
Colin M. Potts
Arnav Jhala
105
0
0
08 Dec 2023
Pitfall of Optimism: Distributional Reinforcement Learning by Randomizing Risk Criterion
Taehyun Cho
Seung Han
Heesoo Lee
Kyungjae Lee
Jungwoo Lee
145
6
0
25 Oct 2023
DRL-ORA: Distributional Reinforcement Learning with Online Risk Adaption
Yupeng Wu
Wenyun Li
Wenjie Huang
Chin Pang Ho
OffRL
131
0
0
08 Oct 2023
Distributional Off-Policy Evaluation for Slate Recommendations
Shreyas Chaudhari
David Arbour
Georgios Theocharous
N. Vlassis
OffRL
113
0
0
27 Aug 2023
Value-Distributional Model-Based Reinforcement Learning
Carlos E. Luis
A. Bottero
Julia Vinogradska
Felix Berkenkamp
Jan Peters
OffRL
98
6
0
12 Aug 2023
Cramer Type Distances for Learning Gaussian Mixture Models by Gradient Descent
Ruichong Zhang
85
0
0
13 Jul 2023
Is Risk-Sensitive Reinforcement Learning Properly Resolved?
Ruiwen Zhou
Minghuan Liu
Kan Ren
Xufang Luo
Weinan Zhang
Dongsheng Li
88
3
0
02 Jul 2023
A Distribution Optimization Framework for Confidence Bounds of Risk Measures
Hao Liang
Zhimin Luo
92
3
0
12 Jun 2023
The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning
Kaiwen Wang
Kevin Zhou
Runzhe Wu
Nathan Kallus
Wen Sun
OffRL
188
21
0
25 May 2023
On Dynamic Programming Decompositions of Static Risk Measures in Markov Decision Processes
J. Hau
Erick Delage
Mohammad Ghavamzadeh
Marek Petrik
205
11
0
24 Apr 2023
Robust Route Planning with Distributional Reinforcement Learning in a Stochastic Road Network Environment
Xi Lin
Paul Szenher
John D. Martin
Brendan Englot
100
2
0
19 Apr 2023
Toward Risk-based Optimistic Exploration for Cooperative Multi-Agent Reinforcement Learning
Ji-Yun Oh
Joonkee Kim
Minchan Jeong
Se-Young Yun
123
1
0
03 Mar 2023
Forward-PECVaR Algorithm: Exact Evaluation for CVaR SSPs
Willy Arthur Silva Reis
D. B. Pais
Valdinei Freire
K. V. Delgado
72
0
0
01 Mar 2023
Distributional Offline Policy Evaluation with Predictive Error Guarantees
Runzhe Wu
Masatoshi Uehara
Wen Sun
OffRL
113
17
0
19 Feb 2023
Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR
Kaiwen Wang
Nathan Kallus
Wen Sun
209
25
0
07 Feb 2023
Risk-Averse Model Uncertainty for Distributionally Robust Safe Reinforcement Learning
James Queeney
M. Benosman
OOD
OffRL
150
9
0
30 Jan 2023
One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement Learning
Marc Rigter
Bruno Lacerda
Nick Hawes
OffRL
212
9
0
30 Nov 2022
Bridging Distributional and Risk-sensitive Reinforcement Learning with Provable Regret Bounds
Hao Liang
Zhihui Luo
149
16
0
25 Oct 2022
Regret Bounds for Risk-Sensitive Reinforcement Learning
Osbert Bastani
Y. Ma
E. Shen
Wei Xu
97
20
0
11 Oct 2022
Off-Policy Risk Assessment in Markov Decision Processes
Audrey Huang
Liu Leqi
Zachary Chase Lipton
Kamyar Azizzadenesheli
OffRL
172
8
0
21 Sep 2022
Enforcing Delayed-Impact Fairness Guarantees
Aline Weber
Blossom Metevier
Yuriy Brun
Philip S. Thomas
Bruno Castro da Silva
FaML
222
11
0
24 Aug 2022
Distributional Actor-Critic Ensemble for Uncertainty-Aware Continuous Control
T. Kanazawa
Haiyan Wang
Chetan Gupta
UQCV
124
5
0
27 Jul 2022
Conformal Off-Policy Prediction in Contextual Bandits
Muhammad Faaiz Taufiq
Jean-François Ton
R. Cornish
Yee Whye Teh
Arnaud Doucet
OffRL
231
26
0
09 Jun 2022
Provably Efficient Risk-Sensitive Reinforcement Learning: Iterated CVaR and Worst Path
Yihan Du
Siwei Wang
Longbo Huang
OOD
114
15
0
06 Jun 2022
The Sufficiency of Off-Policyness and Soft Clipping: PPO is still Insufficient according to an Off-Policy Measure
Xing Chen
Dongcui Diao
Hechang Chen
Hengshuai Yao
Haiyin Piao
Zhixiao Sun
Zhiwei Yang
Randy Goebel
Bei Jiang
Yi-Ju Chang
OffRL
266
17
0
20 May 2022
Efficient Risk-Averse Reinforcement Learning
Ido Greenberg
Yinlam Chow
Mohammad Ghavamzadeh
Shie Mannor
124
49
0
10 May 2022
Risk-aware Stochastic Shortest Path
Tobias Meggendorfer
69
11
0
03 Mar 2022
Two steps to risk sensitivity
Christian Gagné
Peter Dayan
83
12
0
12 Nov 2021
Planning for Risk-Aversion and Expected Value in MDPs
Marc Rigter
Paul Duckworth
Bo Li
Shouhong Ding
103
11
0
25 Oct 2021
Motion Planning for Autonomous Vehicles in the Presence of Uncertainty Using Reinforcement Learning
K. Rezaee
Peyman Yadmellat
Simón Chamorro
60
22
0
01 Oct 2021
ACReL: Adversarial Conditional value-at-risk Reinforcement Learning
Mathieu Godbout
M. Heuillet
Sharath Chandra
R. Bhati
Audrey Durand
140
1
0
20 Sep 2021
Conservative Offline Distributional Reinforcement Learning
Yecheng Jason Ma
Dinesh Jayaraman
Osbert Bastani
OffRL
140
89
0
12 Jul 2021
Automatic Risk Adaptation in Distributional Reinforcement Learning
Frederik Schubert
Theresa Eimer
Bodo Rosenhahn
Marius Lindauer
76
8
0
11 Jun 2021
Universal Off-Policy Evaluation
Yash Chandak
S. Niekum
Bruno C. da Silva
Erik Learned-Miller
Emma Brunskill
Philip S. Thomas
OffRL
ELM
131
55
0
26 Apr 2021
Off-Policy Risk Assessment in Contextual Bandits
Audrey Huang
Liu Leqi
Zachary Chase Lipton
Kamyar Azizzadenesheli
OffRL
112
40
0
18 Apr 2021
Lyapunov Barrier Policy Optimization
Harshit S. Sikchi
Wenxuan Zhou
David Held
116
15
0
16 Mar 2021
On the Convergence and Optimality of Policy Gradient for Markov Coherent Risk
Audrey Huang
Liu Leqi
Zachary Chase Lipton
Kamyar Azizzadenesheli
125
23
0
04 Mar 2021
RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement Learning Agents
Wei Qiu
Xinrun Wang
Runsheng Yu
Xu He
Rongpin Wang
Bo An
S. Obraztsova
Zinovi Rabinovich
111
53
0
16 Feb 2021
1
2
Next