ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1912.05109
  4. Cited By
Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement
  Learning

Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning

11 December 2019
Riashat Islam
Raihan Seraj
Samin Yeasar Arnob
Doina Precup
    OffRL
ArXiv (abs)PDFHTML

Papers citing "Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning"

3 / 3 papers shown
A Reinforcement Learning Method for Environments with Stochastic Variables: Post-Decision Proximal Policy Optimization with Dual Critic Networks
A Reinforcement Learning Method for Environments with Stochastic Variables: Post-Decision Proximal Policy Optimization with Dual Critic Networks
L. Felizardo
Edoardo Fadda
Paolo Brandimarte
E. Del-Moral-Hernandez
Mariá Cristina Vasconcelos Nascimento
OffRL
308
0
0
07 Apr 2025
Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution Mismatch
Global Optimality and Finite Sample Analysis of Softmax Off-Policy Actor Critic under State Distribution MismatchJournal of machine learning research (JMLR), 2021
Shangtong Zhang
Rémi Tachet des Combes
Romain Laroche
405
16
0
04 Nov 2021
Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality
Doubly Robust Off-Policy Actor-Critic: Convergence and OptimalityInternational Conference on Machine Learning (ICML), 2021
Tengyu Xu
Zhuoran Yang
Zhaoran Wang
Yingbin Liang
OffRL
281
29
0
23 Feb 2021
1
Page 1 of 1