Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2210.06766
Cited By
Policy Gradient With Serial Markov Chain Reasoning
13 October 2022
Edoardo Cetin
Oya Celiktutan
BDL
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Policy Gradient With Serial Markov Chain Reasoning"
2 / 2 papers shown
Title
Learning Pessimism for Robust and Efficient Off-Policy Reinforcement Learning
Edoardo Cetin
Oya Celiktutan
OffRL
32
16
0
07 Oct 2021
Iterative Amortized Policy Optimization
Joseph Marino
Alexandre Piché
Alessandro Davide Ialongo
Yisong Yue
OffRL
50
22
0
20 Oct 2020
1