ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1911.03618
  4. Cited By
Worst Cases Policy Gradients

Worst Cases Policy Gradients

9 November 2019
Yichuan Tang
Jian Zhang
Ruslan Salakhutdinov
ArXivPDFHTML

Papers citing "Worst Cases Policy Gradients"

25 / 25 papers shown
Title
Return Capping: Sample-Efficient CVaR Policy Gradient Optimisation
Return Capping: Sample-Efficient CVaR Policy Gradient Optimisation
Harry Mead
Clarissa Costen
Bruno Lacerda
Nick Hawes
24
0
0
29 Apr 2025
Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies
Ensemble RL through Classifier Models: Enhancing Risk-Return Trade-offs in Trading Strategies
Zheli Xiong
49
0
0
23 Feb 2025
DIAL: Distribution-Informed Adaptive Learning of Multi-Task Constraints for Safety-Critical Systems
DIAL: Distribution-Informed Adaptive Learning of Multi-Task Constraints for Safety-Critical Systems
Se-Wook Yoo
Seung-Woo Seo
55
0
0
30 Jan 2025
Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence
Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence
Minheng Xiao
Xian Yu
Lei Ying
40
2
0
23 May 2024
RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer
  Crashes
RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes
Kyle Stachowicz
Sergey Levine
17
6
0
07 May 2024
UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User
  Experiences in Recommender Systems
UOEP: User-Oriented Exploration Policy for Enhancing Long-Term User Experiences in Recommender Systems
Changshuo Zhang
Sirui Chen
Xiao Zhang
Sunhao Dai
Weijie Yu
Jun Xu
OffRL
35
1
0
17 Jan 2024
TRC: Trust Region Conditional Value at Risk for Safe Reinforcement
  Learning
TRC: Trust Region Conditional Value at Risk for Safe Reinforcement Learning
Dohyeong Kim
Songhwai Oh
19
19
0
01 Dec 2023
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
On the Theory of Risk-Aware Agents: Bridging Actor-Critic and Economics
Michal Nauman
Marek Cygan
35
1
0
30 Oct 2023
Learning Risk-Aware Quadrupedal Locomotion using Distributional
  Reinforcement Learning
Learning Risk-Aware Quadrupedal Locomotion using Distributional Reinforcement Learning
Lukas Schneider
Jonas Frey
Takahiro Miki
Marco Hutter
30
9
0
25 Sep 2023
Is Risk-Sensitive Reinforcement Learning Properly Resolved?
Is Risk-Sensitive Reinforcement Learning Properly Resolved?
Ruiwen Zhou
Minghuan Liu
Kan Ren
Xufang Luo
Weinan Zhang
Dongsheng Li
27
2
0
02 Jul 2023
Risk-Sensitive Policy with Distributional Reinforcement Learning
Risk-Sensitive Policy with Distributional Reinforcement Learning
Thibaut Théate
D. Ernst
OffRL
30
5
0
30 Dec 2022
Characterising the Robustness of Reinforcement Learning for Continuous
  Control using Disturbance Injection
Characterising the Robustness of Reinforcement Learning for Continuous Control using Disturbance Injection
Catherine R. Glossop
Jacopo Panerati
A. Krishnan
Zhaocong Yuan
Angela P. Schoellig
22
6
0
27 Oct 2022
Adaptive Risk-Tendency: Nano Drone Navigation in Cluttered Environments
  with Distributional Reinforcement Learning
Adaptive Risk-Tendency: Nano Drone Navigation in Cluttered Environments with Distributional Reinforcement Learning
Cheng Liu
E. Kampen
Guido de Croon
31
16
0
28 Mar 2022
Distributional Reinforcement Learning for Scheduling of Chemical
  Production Processes
Distributional Reinforcement Learning for Scheduling of Chemical Production Processes
M. Mowbray
Dongda Zhang
Ehecatl Antonio del Rio Chanona
OffRL
25
6
0
01 Mar 2022
Learning to Be Cautious
Learning to Be Cautious
Montaser Mohammedalamen
Dustin Morrill
Alexander Sieusahai
Yash Satsangi
Michael Bowling
18
3
0
29 Oct 2021
How to Certify Machine Learning Based Safety-critical Systems? A
  Systematic Literature Review
How to Certify Machine Learning Based Safety-critical Systems? A Systematic Literature Review
Florian Tambon
Gabriel Laberge
Le An
Amin Nikanjam
Paulina Stevia Nouwou Mindom
Y. Pequignot
Foutse Khomh
G. Antoniol
E. Merlo
François Laviolette
30
66
0
26 Jul 2021
Conservative Offline Distributional Reinforcement Learning
Conservative Offline Distributional Reinforcement Learning
Yecheng Jason Ma
Dinesh Jayaraman
Osbert Bastani
OffRL
70
78
0
12 Jul 2021
Lyapunov Barrier Policy Optimization
Lyapunov Barrier Policy Optimization
Harshit S. Sikchi
Wenxuan Zhou
David Held
26
14
0
16 Mar 2021
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
Maximum Entropy RL (Provably) Solves Some Robust RL Problems
Benjamin Eysenbach
Sergey Levine
OOD
41
175
0
10 Mar 2021
Risk-Averse Bayes-Adaptive Reinforcement Learning
Risk-Averse Bayes-Adaptive Reinforcement Learning
Marc Rigter
Bruno Lacerda
Nick Hawes
24
41
0
10 Feb 2021
Soft-Robust Algorithms for Batch Reinforcement Learning
Soft-Robust Algorithms for Batch Reinforcement Learning
Elita Lobo
Mohammad Ghavamzadeh
Marek Petrik
OffRL
28
4
0
30 Nov 2020
Learning to be Safe: Deep RL with a Safety Critic
Learning to be Safe: Deep RL with a Safety Critic
K. Srinivasan
Benjamin Eysenbach
Sehoon Ha
Jie Tan
Chelsea Finn
OffRL
33
141
0
27 Oct 2020
One Solution is Not All You Need: Few-Shot Extrapolation via Structured
  MaxEnt RL
One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL
Saurabh Kumar
Aviral Kumar
Sergey Levine
Chelsea Finn
OffRL
16
90
0
27 Oct 2020
Bayesian Robust Optimization for Imitation Learning
Bayesian Robust Optimization for Imitation Learning
Daniel S. Brown
S. Niekum
Marek Petrik
27
32
0
24 Jul 2020
Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach
Risk-Sensitive and Robust Decision-Making: a CVaR Optimization Approach
Yinlam Chow
Aviv Tamar
Shie Mannor
Marco Pavone
73
312
0
06 Jun 2015
1