Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2208.00755
Cited By
Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step Q-learning: A Novel Correction Approach
1 August 2022
Baturay Saglam
Dogan C. Cicek
Furkan B. Mutlu
Suleyman Serdar Kozat
OffRL
OnRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step Q-learning: A Novel Correction Approach"
3 / 3 papers shown
Title
Compatible Gradient Approximations for Actor-Critic Algorithms
Baturay Saglam
Dionysis Kalogerias
29
0
0
02 Sep 2024
On the Reuse Bias in Off-Policy Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Hang Su
Dong Yan
Jun Zhu
OffRL
32
3
0
15 Sep 2022
SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning
Marco Bagatella
Sammy Christen
Otmar Hilliges
OffRL
24
5
0
26 May 2022
1