Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2101.02137
Cited By
Smoothed functional-based gradient algorithms for off-policy reinforcement learning: A non-asymptotic viewpoint
6 January 2021
Nithia Vijayan
A. PrashanthL.
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Smoothed functional-based gradient algorithms for off-policy reinforcement learning: A non-asymptotic viewpoint"
2 / 2 papers shown
Title
Online Estimation and Optimization of Utility-Based Shortfall Risk
Vishwajit Hegde
Arvind S. Menon
L. A. Prashanth
Krishna Jagannathan
11
2
0
16 Nov 2021
Off-Policy Actor-Critic
T. Degris
Martha White
R. Sutton
OffRL
CML
158
220
0
22 May 2012
1