Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2504.20887
Cited By
v1
v2 (latest)
Return Capping: Sample-Efficient CVaR Policy Gradient Optimisation
29 April 2025
Harry Mead
Clarissa Costen
Bruno Lacerda
Nick Hawes
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Return Capping: Sample-Efficient CVaR Policy Gradient Optimisation"
22 / 22 papers shown
Policy Optimization Prefers The Path of Least Resistance
Debdeep Sanyal
Aakash Sen Sharma
Dhruv Kumar
Saurabh Deshpande
Murari Mandal
167
1
0
22 Oct 2025
Risk-Aware Reinforcement Learning with Bandit-Based Adaptation for Quadrupedal Locomotion
Yuanhong Zeng
Anushri Dixit
OffRL
101
0
0
16 Oct 2025
Ergodic Risk Measures: Towards a Risk-Aware Foundation for Continual Reinforcement Learning
Juan Sebastian Rojas
Chi-Guhn Lee
150
0
0
03 Oct 2025
Gymnasium: A Standard Interface for Reinforcement Learning Environments
Mark Towers
Ariel Kwiatkowski
Jordan Terry
John U. Balis
Gianluca De Cola
...
Andrea Pierré
Sander Schulhoff
Jun Jet Tai
Hannah Tan
Omar G. Younis
AuLLM
OffRL
416
501
0
24 Jul 2024
A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Yudong Luo
Yangchen Pan
Han Wang
Juil Sock
Pascal Poupart
356
5
0
17 Mar 2024
Learning Risk-Aware Quadrupedal Locomotion using Distributional Reinforcement Learning
IEEE International Conference on Robotics and Automation (ICRA), 2023
Lukas Schneider
Jonas Frey
Takahiro Miki
Marco Hutter
322
20
0
25 Sep 2023
An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient
Neural Information Processing Systems (NeurIPS), 2023
Yudong Luo
Guiliang Liu
Pascal Poupart
Yangchen Pan
363
13
0
17 Jul 2023
Risk-Aware Reward Shaping of Reinforcement Learning Agents for Autonomous Driving
Annual Conference of the IEEE Industrial Electronics Society (IECON), 2023
Linjin Wu
Zengjie Zhang
S. Haesaert
Zhiqiang Ma
Zhiyong Sun
OffRL
180
8
0
05 Jun 2023
Efficient Risk-Averse Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2022
Ido Greenberg
Yinlam Chow
Mohammad Ghavamzadeh
Shie Mannor
288
53
0
10 May 2022
Risk-Averse Bayes-Adaptive Reinforcement Learning
Neural Information Processing Systems (NeurIPS), 2021
Marc Rigter
Bruno Lacerda
Nick Hawes
198
45
0
10 Feb 2021
Worst Cases Policy Gradients
Conference on Robot Learning (CoRL), 2019
Yichuan Tang
Jian Zhang
Ruslan Salakhutdinov
169
81
0
09 Nov 2019
Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy
AAAI Conference on Artificial Intelligence (AAAI), 2019
Ramtin Keramati
Christoph Dann
Alex Tamkin
Emma Brunskill
320
84
0
05 Nov 2019
Risk Averse Robust Adversarial Reinforcement Learning
Xinlei Pan
Daniel Seita
Yang Gao
John F. Canny
AAML
152
107
0
31 Mar 2019
Implicit Quantile Networks for Distributional Reinforcement Learning
Will Dabney
Georg Ostrovski
David Silver
Rémi Munos
OffRL
332
621
0
14 Jun 2018
A Distributional Perspective on Reinforcement Learning
Marc G. Bellemare
Will Dabney
Rémi Munos
OffRL
304
1,714
0
21 Jul 2017
Proximal Policy Optimization Algorithms
John Schulman
Filip Wolski
Prafulla Dhariwal
Alec Radford
Oleg Klimov
OffRL
1.3K
24,405
0
20 Jul 2017
Robust Adversarial Reinforcement Learning
Lerrel Pinto
James Davidson
Rahul Sukthankar
Abhinav Gupta
OOD
320
972
0
08 Mar 2017
OpenAI Gym
Greg Brockman
Vicki Cheung
Ludwig Pettersson
Jonas Schneider
John Schulman
Jie Tang
Wojciech Zaremba
OffRL
ODL
578
5,414
0
05 Jun 2016
Risk-Constrained Reinforcement Learning with Percentile Risk Criteria
Yinlam Chow
Mohammad Ghavamzadeh
Lucas Janson
Marco Pavone
354
583
0
05 Dec 2015
Contextual Markov Decision Processes
Assaf Hallak
Dotan Di Castro
Shie Mannor
384
286
0
08 Feb 2015
Optimizing the CVaR via Sampling
AAAI Conference on Artificial Intelligence (AAAI), 2014
Aviv Tamar
Yonatan Glassner
Shie Mannor
478
203
0
15 Apr 2014
Variance-Constrained Actor-Critic Algorithms for Discounted and Average Reward MDPs
Machine-mediated learning (ML), 2014
Prashanth L.A.
Mohammad Ghavamzadeh
271
73
0
25 Mar 2014
1
Page 1 of 1