Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2202.02396
Cited By
A Temporal-Difference Approach to Policy Gradient Estimation
4 February 2022
Samuele Tosatto
Andrew Patterson
Martha White
A. R. Mahmood
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Temporal-Difference Approach to Policy Gradient Estimation"
3 / 3 papers shown
Title
Analysis of On-policy Policy Gradient Methods under the Distribution Mismatch
Weizhen Wang
Jianping He
Xiaoming Duan
34
0
0
28 Mar 2025
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
334
1,951
0
04 May 2020
Off-Policy Actor-Critic
T. Degris
Martha White
R. Sutton
OffRL
CML
158
220
0
22 May 2012
1