Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.13682
Cited By
POPO: Pessimistic Offline Policy Optimization
26 December 2020
Qiang He
Xinwen Hou
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"POPO: Pessimistic Offline Policy Optimization"
2 / 2 papers shown
Title
Adaptive Policy Learning for Offline-to-Online Reinforcement Learning
Han Zheng
Xufang Luo
Pengfei Wei
Xuan Song
Dongsheng Li
Jing Jiang
OffRL
OnRL
8
21
0
14 Mar 2023
MEPG: A Minimalist Ensemble Policy Gradient Framework for Deep Reinforcement Learning
Qiang He
Yuxun Qu
Chen Gong
Xinwen Hou
OffRL
14
10
0
22 Sep 2021
1