ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2405.02572
  4. Cited By
Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent
  Baseline

Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline

4 May 2024
Wenjia Meng
Qian Zheng
Long Yang
Yilong Yin
Gang Pan
    OffRL
ArXivPDFHTML

Papers citing "Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline"

1 / 1 papers shown
Title
On the Convergence and Sample Efficiency of Variance-Reduced Policy
  Gradient Method
On the Convergence and Sample Efficiency of Variance-Reduced Policy Gradient Method
Junyu Zhang
Chengzhuo Ni
Zheng Yu
Csaba Szepesvári
Mengdi Wang
44
66
0
17 Feb 2021
1