ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.02125
  4. Cited By
TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from
  Mixed Datasets

TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets

5 December 2022
Yuanying Cai
Chuheng Zhang
Li Zhao
Wei Shen
Xuyun Zhang
Lei Song
Jiang Bian
Tao Qin
Tie-Yan Liu
    OffRL
ArXivPDFHTML

Papers citing "TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets"

4 / 4 papers shown
Title
Cross DQN: Cross Deep Q Network for Ads Allocation in Feed
Cross DQN: Cross Deep Q Network for Ads Allocation in Feed
Guogang Liao
Zewen Wang
Xiaoxu Wu
Xiaowen Shi
Chuheng Zhang
Yongkang Wang
Xingxing Wang
Dong Wang
33
36
0
09 Sep 2021
Continuous Doubly Constrained Batch Reinforcement Learning
Continuous Doubly Constrained Batch Reinforcement Learning
Rasool Fakoor
Jonas W. Mueller
Kavosh Asadi
Pratik Chaudhari
Alex Smola
OffRL
204
27
0
18 Feb 2021
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline
  and Online RL
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
Seyed Kamyar Seyed Ghasemipour
Dale Schuurmans
S. Gu
OffRL
209
119
0
21 Jul 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
334
1,951
0
04 May 2020
1