ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2206.04436
  4. Cited By
Towards Safe Reinforcement Learning via Constraining Conditional
  Value-at-Risk

Towards Safe Reinforcement Learning via Constraining Conditional Value-at-Risk

9 June 2022
Chengyang Ying
Xinning Zhou
Hang Su
Dong Yan
Ning Chen
Jun Zhu
ArXivPDFHTML

Papers citing "Towards Safe Reinforcement Learning via Constraining Conditional Value-at-Risk"

28 / 28 papers shown
Title
Bridging Econometrics and AI: VaR Estimation via Reinforcement Learning and GARCH Models
Bridging Econometrics and AI: VaR Estimation via Reinforcement Learning and GARCH Models
Fredy Pokou
Jules Sadefo Kamdem
François Benhmad
AIFin
34
0
0
23 Apr 2025
Measures of Variability for Risk-averse Policy Gradient
Measures of Variability for Risk-averse Policy Gradient
Yudong Luo
Yangchen Pan
Jiaqi Tan
Pascal Poupart
40
0
0
15 Apr 2025
Task Aware Dreamer for Task Generalization in Reinforcement Learning
Task Aware Dreamer for Task Generalization in Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Hang Su
Songming Liu
Dong Yan
Jun Zhu
66
3
0
17 Feb 2025
Risk-averse policies for natural gas futures trading using distributional reinforcement learning
Risk-averse policies for natural gas futures trading using distributional reinforcement learning
Félicien Hêche
Biagio Nigro
Oussama Barakat
Stephan Robert-Nicoud
OffRL
39
0
0
08 Jan 2025
The Perfect Blend: Redefining RLHF with Mixture of Judges
The Perfect Blend: Redefining RLHF with Mixture of Judges
Tengyu Xu
Eryk Helenowski
Karthik Abinav Sankararaman
Di Jin
Kaiyan Peng
...
Gabriel Cohen
Yuandong Tian
Hao Ma
Sinong Wang
Han Fang
35
9
0
30 Sep 2024
Handling Long-Term Safety and Uncertainty in Safe Reinforcement Learning
Handling Long-Term Safety and Uncertainty in Safe Reinforcement Learning
Jonas Günster
Puze Liu
Jan Peters
Davide Tateo
OffRL
23
2
0
18 Sep 2024
Revisiting Safe Exploration in Safe Reinforcement learning
Revisiting Safe Exploration in Safe Reinforcement learning
David Eckel
Baohe Zhang
Joschka Bödecker
44
0
0
02 Sep 2024
Bridging the gap between Learning-to-plan, Motion Primitives and Safe
  Reinforcement Learning
Bridging the gap between Learning-to-plan, Motion Primitives and Safe Reinforcement Learning
Piotr Kicki
Davide Tateo
Puze Liu
Jonas Guenster
Jan Peters
Krzysztof Walas
31
2
0
26 Aug 2024
Bootstrapping Expectiles in Reinforcement Learning
Bootstrapping Expectiles in Reinforcement Learning
Pierre Clavier
Emmanuel Rachelson
E. L. Pennec
Matthieu Geist
OffRL
38
0
0
06 Jun 2024
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees
Dohyeong Kim
Taehyun Cho
Seung Han
Hojun Chung
Kyungjae Lee
Songhwai Oh
34
0
0
29 May 2024
PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement
  Learning
PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Xuezhou Xu
Hang Su
Xingxing Zhang
Jun Zhu
32
4
0
23 May 2024
Robust Risk-Sensitive Reinforcement Learning with Conditional
  Value-at-Risk
Robust Risk-Sensitive Reinforcement Learning with Conditional Value-at-Risk
Xinyi Ni
Lifeng Lai
44
0
0
02 May 2024
Safe Reinforcement Learning on the Constraint Manifold: Theory and
  Applications
Safe Reinforcement Learning on the Constraint Manifold: Theory and Applications
Puze Liu
Haitham Bou-Ammar
Jan Peters
Davide Tateo
38
7
0
13 Apr 2024
Constrained Reinforcement Learning with Smoothed Log Barrier Function
Constrained Reinforcement Learning with Smoothed Log Barrier Function
Baohe Zhang
Yuan Zhang
Lilli Frison
Thomas Brox
Joschka Bödecker
35
8
0
21 Mar 2024
A Simple Mixture Policy Parameterization for Improving Sample Efficiency
  of CVaR Optimization
A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
Yudong Luo
Yangchen Pan
Han Wang
Philip H. S. Torr
Pascal Poupart
34
3
0
17 Mar 2024
Enhancing LLM Safety via Constrained Direct Preference Optimization
Enhancing LLM Safety via Constrained Direct Preference Optimization
Zixuan Liu
Xiaolin Sun
Zizhan Zheng
41
20
0
04 Mar 2024
TRC: Trust Region Conditional Value at Risk for Safe Reinforcement
  Learning
TRC: Trust Region Conditional Value at Risk for Safe Reinforcement Learning
Dohyeong Kim
Songhwai Oh
14
19
0
01 Dec 2023
Efficient Off-Policy Safe Reinforcement Learning Using Trust Region
  Conditional Value at Risk
Efficient Off-Policy Safe Reinforcement Learning Using Trust Region Conditional Value at Risk
Dohyeong Kim
Songhwai Oh
OffRL
12
19
0
01 Dec 2023
Adjustable Robust Reinforcement Learning for Online 3D Bin Packing
Adjustable Robust Reinforcement Learning for Online 3D Bin Packing
Yuxin Pan
Yize Chen
Fangzhen Lin
OffRL
43
9
0
06 Oct 2023
Extreme Risk Mitigation in Reinforcement Learning using Extreme Value
  Theory
Extreme Risk Mitigation in Reinforcement Learning using Extreme Value Theory
NS KarthikSomayaji
Yu Wang
M. Schram
Ján Drgoňa
M. Halappanavar
Frank Liu
Peng Li
17
0
0
24 Aug 2023
Learning Diverse Risk Preferences in Population-based Self-play
Learning Diverse Risk Preferences in Population-based Self-play
Y. Jiang
Qihan Liu
Xiaoteng Ma
Chenghao Li
Yiqin Yang
Jun Yang
Bin Liang
Qianchuan Zhao
54
5
0
19 May 2023
Risk Sensitive Dead-end Identification in Safety-Critical Offline
  Reinforcement Learning
Risk Sensitive Dead-end Identification in Safety-Critical Offline Reinforcement Learning
Taylor W. Killian
S. Parbhoo
Marzyeh Ghassemi
OffRL
18
6
0
13 Jan 2023
SoK: Adversarial Machine Learning Attacks and Defences in Multi-Agent
  Reinforcement Learning
SoK: Adversarial Machine Learning Attacks and Defences in Multi-Agent Reinforcement Learning
Maxwell Standen
Junae Kim
Claudia Szabo
AAML
29
5
0
11 Jan 2023
On the Reuse Bias in Off-Policy Reinforcement Learning
On the Reuse Bias in Off-Policy Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Hang Su
Dong Yan
Jun Zhu
OffRL
32
3
0
15 Sep 2022
Robust Reinforcement Learning with Distributional Risk-averse
  formulation
Robust Reinforcement Learning with Distributional Risk-averse formulation
Pierre Clavier
S. Allassonnière
E. L. Pennec
OOD
31
7
0
14 Jun 2022
Consistent Attack: Universal Adversarial Perturbation on Embodied Vision
  Navigation
Consistent Attack: Universal Adversarial Perturbation on Embodied Vision Navigation
Chengyang Ying
You Qiaoben
Xinning Zhou
Hang Su
Wenbo Ding
Jianyong Ai
AAML
16
11
0
12 Jun 2022
Understanding Adversarial Attacks on Observations in Deep Reinforcement
  Learning
Understanding Adversarial Attacks on Observations in Deep Reinforcement Learning
You Qiaoben
Chengyang Ying
Xinning Zhou
Hang Su
Jun Zhu
Bo Zhang
AAML
25
14
0
30 Jun 2021
Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach
Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach
Yinlam Chow
Aviv Tamar
Shie Mannor
Marco Pavone
67
310
0
06 Jun 2015
1