Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2212.02125
Cited By
TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets
5 December 2022
Yuanying Cai
Chuheng Zhang
Li Zhao
Wei Shen
Xuyun Zhang
Lei Song
Jiang Bian
Tao Qin
Tie-Yan Liu
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets"
4 / 4 papers shown
Title
Cross DQN: Cross Deep Q Network for Ads Allocation in Feed
Guogang Liao
Zewen Wang
Xiaoxu Wu
Xiaowen Shi
Chuheng Zhang
Yongkang Wang
Xingxing Wang
Dong Wang
33
36
0
09 Sep 2021
Continuous Doubly Constrained Batch Reinforcement Learning
Rasool Fakoor
Jonas W. Mueller
Kavosh Asadi
Pratik Chaudhari
Alex Smola
OffRL
204
27
0
18 Feb 2021
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Seyed Kamyar Seyed Ghasemipour
Dale Schuurmans
S. Gu
OffRL
209
119
0
21 Jul 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
334
1,951
0
04 May 2020
1