ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2208.00755
  4. Cited By
Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step
  Q-learning: A Novel Correction Approach

Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step Q-learning: A Novel Correction Approach

1 August 2022
Baturay Saglam
Dogan C. Cicek
Furkan B. Mutlu
Suleyman Serdar Kozat
    OffRL
    OnRL
ArXivPDFHTML

Papers citing "Mitigating Off-Policy Bias in Actor-Critic Methods with One-Step Q-learning: A Novel Correction Approach"

3 / 3 papers shown
Title
Compatible Gradient Approximations for Actor-Critic Algorithms
Compatible Gradient Approximations for Actor-Critic Algorithms
Baturay Saglam
Dionysis Kalogerias
29
0
0
02 Sep 2024
On the Reuse Bias in Off-Policy Reinforcement Learning
On the Reuse Bias in Off-Policy Reinforcement Learning
Chengyang Ying
Zhongkai Hao
Xinning Zhou
Hang Su
Dong Yan
Jun Zhu
OffRL
32
3
0
15 Sep 2022
SFP: State-free Priors for Exploration in Off-Policy Reinforcement
  Learning
SFP: State-free Priors for Exploration in Off-Policy Reinforcement Learning
Marco Bagatella
Sammy Christen
Otmar Hilliges
OffRL
24
5
0
26 May 2022
1