ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.06766
  4. Cited By
Policy Gradient With Serial Markov Chain Reasoning

Policy Gradient With Serial Markov Chain Reasoning

13 October 2022
Edoardo Cetin
Oya Celiktutan
    BDL
    LRM
ArXivPDFHTML

Papers citing "Policy Gradient With Serial Markov Chain Reasoning"

2 / 2 papers shown
Title
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement
  Learning
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning
Edoardo Cetin
Oya Celiktutan
OffRL
32
16
0
07 Oct 2021
Iterative Amortized Policy Optimization
Iterative Amortized Policy Optimization
Joseph Marino
Alexandre Piché
Alessandro Davide Ialongo
Yisong Yue
OffRL
50
22
0
20 Oct 2020
1