ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2106.03934
  4. Cited By
Offline Policy Comparison under Limited Historical Agent-Environment
  Interactions

Offline Policy Comparison under Limited Historical Agent-Environment Interactions

7 June 2021
Anton Dereventsov
Joseph Daws
Clayton Webster
    OffRL
ArXivPDFHTML

Papers citing "Offline Policy Comparison under Limited Historical Agent-Environment Interactions"

3 / 3 papers shown
Title
Examining Policy Entropy of Reinforcement Learning Agents for
  Personalization Tasks
Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks
Anton Dereventsov
Andrew Starnes
Clayton Webster
18
4
0
21 Nov 2022
Discriminator-Weighted Offline Imitation Learning from Suboptimal
  Demonstrations
Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations
Haoran Xu
Xianyuan Zhan
Honglei Yin
Huiling Qin
OffRL
24
66
0
20 Jul 2022
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on
  Open Problems
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
334
1,951
0
04 May 2020
1